Date
Venue
Cloud Center
Event Details
We had a great kickoff event on August 18. The Hackathon continues until our Data Mining Camp on Saturday, October 13 in San Jose
COMPETITION: Which person or team can develop the best prediction of the product on the Best Buy mobile web site a visitor would be most interested in? This competition will be set up using the www.Kaggle.com web site, making data science a sport.
CHALLENGE CHARACTERISTICS:
Two years of mobile behavior, 67 million clicks, 27 million searches, 8 million users, 1 million products
You can enter one or both competitions:
- Cloud computing sized problem
- PC sized problem (a sample of the cloud sized problem)
ANALYSIS APPROACHES:
You may find useful material in the chapter 16 of the
2012 book “Scaling Up Machine Learning” or
GraphLab SVD, Vowpal Wabbit, Mahout, MADlib
http://en.wikipedia.org/wiki/Collaborative_filtering#Model-based
KICKOFF MEETING:
Saturday, August 18th, 9am
Talks in the morning about the problem, HP and Microsoft Cloud infrastructure
BigDataR Linux will provide a 2 hour course on GraphLab, Vowpal Wrabbit, MADlib
Sponsors are providing initial cloud compute time at HP and Microsoft!
See also http://aws.amazon.com/ec2/pricing/ (~ $1 / hr, High memory Linux, N CA)
Lunch provided by Microsoft (Thank You)
Cloud Center: 222 Caspian Dr, Sunnyvale, CA www.SVcloudCenter.com
Free Registration for Kickoff (helps us estimate food): http://sfbayacm.ticketleap.com/DM-Hackathon-2012-10
COMPETITION CLOSES: Saturday, 10/6/2012, 6pm PST
AWARDS GIVEN:
At the Data Mining Camp, Saturday, October 13 in San Jose
See also the ACM Training Class on Big Data, Sunday Oct 14 in San Jose
Prizes given: $2,000+
SPONSORS:
![]() |
HP has donated cloud computing time. |
![]() |
Best Buy has donated prize money. |
![]() |
Microsoft has donated lunch for attendees and computing support. |
![]() |
Amazon Web Services has donated compute time. |
HACKATHON KICKOFF MEETING TALKS, NOTES AND LINKS:
* Hackathon Kickoff by Greg Makowski - SF Bay ACM Chair
* Agenda of Upcoming Events and the Day by Tricia Hoffman - SF Bay ACM Chair of Data Mining
* Recommender Systems by Sarabjeet Chugh - Director at SunGard
* MetaZeta Clusters by Paul Baclace - Hadoop Consulting
* Big DataR and Best Buy Data Set by Nicholas Kolegraff - Data Scientist at Accenture
Setting up an Amazon Web Services Account Amazon Web Services
Video: BigDataR Linux + Data Science + Graphlab, up and running on AWS
* Azure Cloud by Brad Sarsfield - Software Engineer at Microsoft
COMPETITION LINKS (to get data, discuss, share, join teams, submit your results to the competition):
Kaggle's website for hosting competitions
1) Data Mining Hackathon on Big Data
2) Data Mining Hackathon on Smaller Data (start here)
3) Data Visualization competition (may want to work with other teams)
Results
Big Data
|
Kaggle User Name |
Actual Name |
Country |
Place |
|---|---|---|---|
|
LR |
Guocong Song |
San Jose, USA |
|
|
RapStar |
Kingsfield |
Nanjing, China |
|
|
Dragon |
Yan Gu |
China |
|
|
Phoneix |
Kevin Gu |
Shanghai, China |
3rd place |
Small Data
|
Kaggle User Name |
Actual Name |
Country |
Place |
|---|---|---|---|
|
Green Avenger |
David Thomas |
Dayton, Ohio USA |
|
|
vdaniloff |
Vladimir Danilov |
St. Petersburg, Russia |
|
|
CF1 |
Yasser Tabandeh |
Iran |
3rd place |
Visualization Contest
What and Where - Ashish Bansal
BestBuy Query Cloud - Anand Prasanna
Event page provided by ACM







