Building Scalable, Flexible Data Pipelines for Big Data


Monday, February 24, 2014 - 6:30pm - 8:30pm


eBay Whitman Campus

eBay Whitman Campus
2065 Hamilton Ave
San Jose, CA

*** Bring ID (e.g. Driver's License) for eBay Security ***

*** Please arrive by 7 PM due to Security ***

Vivek Ganesan

Event Details

Data Science meeting (Formerly Data Mining)

This presentation will be an overview of ETL tasks and tools in Hadoop and will cover the pros/cons of different approaches.

ETL is Extransform/Load, in other words, it is the process of ingesting data from external sources into Hadoop/HDFS.

Speaker Bio

Vivek has worked on big data and cloud deployments at large companies such as Intuit and Paypal, and also in startups.  Currently he provides expert consulting services to Fortune 500 clients on Big Data projects. 

Event page provided by ACM