Hadoop

Ask Big Data Hadoop Related Questions

1 min read

[vc_row][vc_column][vc_text_separator title=”Ask Any Hadoop and it’s Ecosystem Related Questions” i_icon_fontawesome=”fa fa-question” i_color=”sandy_brown” color=”sky” border_width=”2″ add_icon=”true”][/vc_column][/vc_row][vc_row][vc_column][vc_message]We are creating our forum till the time it is not live, you can post all your questions and doubts here. We will make sure it is getting answered correctly.

All the Hadoop and BI related questions are highly welcomed. You can also post job-related queries.[/vc_message][/vc_column][/vc_row]

7 Comments

Sumit Kumar says:

March 6, 2017 at 4:57 am

Thanks for this website.

Could you please provide me the code or way to schedule Sqoop job daily that load incremental import into one hive table.

It would be great help if I will get step by step solution.

Thanks,
Sumit kumar

Reply
- HDFS Tutorial Team says:
  
  March 6, 2017 at 5:00 am
  
  Hi Sumit,
  
  The incremental load is easy and you can check the following link where we have explained how to do incremental import in Sqoop.
  
  Link: http://hdfstutorial.com/sqoop-import-function/
  
  And for job scheduling in Sqoop, you can check the following tutorial-
  
  http://hdfstutorial.com/sqoop-jobs/
  
  Do try to implement these and let us for any further doubts.
  
  Have any doubts? Please comment here.
  
  Regards,
  HDFS Tutorial Team
  
  Reply
Jerry says:

April 19, 2017 at 9:00 pm

customers = load ‘./in2/customersTable.txt’ using PigStorage(‘ ‘) as (nameCus:chararray, age: int);
purchases = load ‘./in2/purchasesTable.txt’ using PigStorage(‘ ‘) as (namePur:chararray, flavor: chararray);
A = JOIN customers BY (name), purchases BY (name);
B = foreach A generate A::nameCus,A::namePur,A::age,A::flavor;
C = group B by flavor;
D = foreach C generate COUNT(C) as purchaseCount;
E = ORDER D BY purchaseCount DESC;
F = LIMIT E 1;
G = store F into ‘./flavorcount’;

Reply
- HDFS Tutorial Team says:
  
  May 29, 2017 at 1:31 pm
  
  HI,
  
  Are you getting any issues here?
  
  Reply
spoorthy says:

April 20, 2017 at 7:24 am

hai
I installed sqoop as per ur guidelines.when I am using sqoop list-databases command,i am getting an error as could not find or load main class org.apache.sqoop.Sqoop. how to resolve this issue?

Reply
- HDFS Tutorial Team says:
  
  May 29, 2017 at 1:30 pm
  
  Hi,
  
  You need to make sure that you have sqoop-1.4.3.jar under your SQOOP HOME directory.
  
  Also, check the sqoop version you have downloaded. Please try and comment for any other issues.
  
  Reply
gwgreen1 says:

August 11, 2017 at 1:27 pm

I know this is off topic from Hadoop. But somewhat related to Big Data. My question is
Why is a spark RDD immutable?
I know Immutable means cannot change. What is the need for rdd to be immutable and what are the advantages of it?

Reply

You may also like

7 Comments

Leave a Comment X