Data Science meeting (Formerly Data Mining)
This presentation will be an overview of ETL tasks and tools in Hadoop and will cover the pros/cons of different approaches.
ETL is Extransform/Load, in other words, it is the process of ingesting data from external sources into Hadoop/HDFS.
Vivek has worked on big data and cloud deployments at large companies such as Intuit and Paypal, and also in startups. Currently he provides expert consulting services to Fortune 500 clients on Big Data projects.
Event page provided by ACM