An Introduction to R for Software Developers and Data Analysts

Date

Saturday, March 10, 2012 - 8:30am - 5:00pm

Venue

eBay Town Hall

eBay Town Hall
2161 North 1st Street
San Jose, CA 95131
Speaker: 
Robert Kabacoff

Event Details

R has become one of the most popular languages for data analysis and graphics. This workshop will provide a practical introduction to this comprehensive platform.  Participants will learn to import data into R from a variety of sources; clean, recode, and restructure data; and apply R’s many functions for summarizing, modeling, and graphing data. Both basic and more advanced forms of data analysis and graphics will be covered. Additional topics include navigating R’s comprehensive help systems, practical advice for processing data, common programming mistakes to avoid, and useful functions for data mining.

Course Outline

I. Introduction – An introduction to R:  R syntax and data structures; working interactively and in batch; alternative IDEs and GUIs; adding  functionality through packages; common programming mistakes; getting unstuck – were to find answers to your questions

II. Data Management – Importing, cleaning, and reformatting data:  transforming and recoding variables; subsetting, merging, and aggregating data; control structures; user-written functions

III. Graphics – Taking advantage of R’s powerful graphics:  creating basic and advanced graphs; customizing and combining graphs; innovative methods for visualizing complex data

IV. Statistical Analysis and Data Mining – Using R for description, prediction, and classification: descriptive statistics and multi-way tables; ANOVA variants; regression (e.g., linear, logistic, poisson), classification trees, cluster analysis, and other multivariate methods; dealing effectively with missing data; Going further

Here is sample code that will be used in the course.

 About R

Attendees (as of 3/05): 121

Speaker Bio

Dr. Kabacoff is a seasoned researcher, specializing data analysis and graphics since 1986. As Vice President of Research for Management Research Group, he consults widely with academic, government, and corporate organizations throughout North America and Western Europe.  Dr. Kabacoff has taught numerous graduate courses on topics in multivariate statistics and statistical programming and maintains the Quick-R website at http://www.statmethods.net. For more information on his book R in Action, see http://www.manning.com/kabacoff.

 

Event page provided by ACM