Loading...

{{notif_text}}

SocialHelpouts is now CutShort! Read about it here
Who pays how much? Be informed with this salary report on Indian startups.
Why join channels?
Learn from peers
Discuss and share learning resources with the top professionals across the world
Open business or job opportunities
Earn reputation points to get consulting projects, attract talent or land jobs.
Accelerate your growth
Grow your network and get exclusive deals from our learning partners.
signup now
Neelesh Parulkar asked a question

How can R and Hadoop be used together?

 

answer
submitting answer...
submit
No answers yet. Be the first one to answer!
1 answer
VISHNU SUBRAMANIAN Deep learning researcher.
Hadoop is an ecosystem of various components. Some of the components you may be interested to use from R could be SQL(Hive , Impala ,Spark SQL) or for datascience activities. 

SQL : An example scenario could be where you need to pull data from the underlying File system , which in Hadoop is HDFS. One simple way is to use a thrift server which exposes the data like how any database would do. 

DataScience tasks : You can use SparkR for building data pipeline like pulling data from multiple sources , cleaning the data , applying distribnuted Ml .

All these tools are best used based on the problem you are trying to solve . 

Thanks,
Vishnu Subramanian
Loading comments...
To view all answers to this question, join this channel
join this channel
Awesome! You have connected your Facebook account. Like us on Facebook to stay updated.