Building Scalable, Flexible Data Pipelines for Big Data

Date

Monday, February 24, 2014 - 6:30pm - 8:30pm

Venue

eBay Whitman Campus

eBay Whitman Campus
2065 Hamilton Ave
San Jose, CA
Speaker: 
Vivek Ganesan

Event Details

Data Science meeting (Formerly Data Mining)

This presentation will be an overview of ETL tasks and tools in Hadoop and will cover the pros/cons of different approaches.

ETL is Extransform/Load, in other words, it is the process of ingesting data from external sources into Hadoop/HDFS.

Speaker Bio

Vivek has worked on big data and cloud deployments at large companies such as Intuit and Paypal, and also in startups.  Currently he provides expert consulting services to Fortune 500 clients on Big Data projects. 

Event page provided by ACM