Monday, April 24, 2017 at 6:00:00 PM Pinsight Media 1100 Main St #1500, Kansas City, MO

Presenters:


Yotabites Consulting - 

Atul Khachar & Yeshwanth Jagini

Agenda:

To make the most of Big Data, and to reveal the hidden stories, we need to analyze the data.

In this meetup we are going to walk you through tools and framework to boost data scientists stack

Sparklyr - An R interface to run R code in Spark

Rstudio  - Open Source and Enterprise Ready Professional Software for R

In Specific we are going to cover:

1) A little intro about HDFS and SPARK

2) What is Sparklyr?

3) Difference between SparklyR/SparkR/Sparkling water

4) A little intro on RStudio

5) Deep dive into sparklyr with a use case

    -> Overview of environment and 

    -> Install and Setup

    -> Explore sparklyR package

    -> Reading and Writing Data

    -> Exploring dplyr package support for sparklyr

    ->  Demo on analysis of diabetes dataset & build some models

6) Conclusion

If you arrive after 6pm the elevators require a badge. Please sign in at the guard station and they will let you up. Click here for event

0 Response to "April 24: Data Science KC - Creating sparks with Rstudio - Analysis of diabetes dataset"

Post a Comment

Group Tools

Random Prize Winner
Use this tool to generate random numbers for prize drawings.




Follow this twitter list of the twitter accounts for the user groups. Ask for your group to be added to this list: twitter list
Subscribe to the Kansas City User Group Newspaper at Paper.li

Blog Archive

Followers