Monday, April 24, 2017 at 6:00:00 PM , ,


Yotabites Consulting - 

Atul Khachar & Yeshwanth Jagini


To make the most of Big Data, and to reveal the hidden stories, we need to analyze the data.

In this meetup we are going to walk you through tools and framework to boost data scientists stack

Sparklyr - An R interface to run R code in Spark

Rstudio  - Open Source and Enterprise Ready Professional Software for R

In Specific we are going to cover:

1) A little intro about HDFS and SPARK

2) What is Sparklyr?

3) Difference between SparklyR/SparkR/Sparkling water

4) A little intro on RStudio

5) Deep dive into sparklyr with a use case

    -> Overview of environment and 

    -> Install and Setup

    -> Explore sparklyR package

    -> Reading and Writing Data

    -> Exploring dplyr package support for sparklyr

    ->  Demo on analysis of diabetes dataset & build some models

6) Conclusion

Click here for event

0 Response to "April 24: Data Science KC - Creating sparks with Rstudio - Analysis of diabetes dataset"

Post a Comment

Group Tools

Random Prize Winner
Use this tool to generate random numbers for prize drawings.

Computer Humor Slideshow
Just for fun!

Follow this twitter list of the twitter accounts for the user groups. Ask for your group to be added to this list: twitter list
Subscribe to the Kansas City User Group Newspaper at

Blog Archive

About Me

My photo

Work: I love computers and am passionate about writing good software and sharing what I've learned.
Community: I volunteer time and money to several organizations like Rose Brooks Center, Hope House, MOCSA, CAPA, my HOA, and the software development community.