Wednesday, February 15, 2017 at 5:00:00 PM Live Webinar, Kansas City, KS

Structured Streaming is a new scalable and fault-tolerant stream processing engine built on the Spark SQL engine. Structured Streaming allows you to express your streaming computation the same way you would express a computation on static data. This has two benefits. The first is code reuse as essentially the same queries be run on batch, interactive or streaming data. Second, it simplifies streaming application development as you can operate on streams of data just like you can on static data using DataFrames. Structured Streaming abstracts away the complexity of streaming analytics allowing you to perform streaming analytics without having to reason about streaming. The Spark SQL engine takes care of running Structured Streaming queries and incrementally and continuously updating the result as streaming data continues to arrive. With Spark Structured Streaming, you can express streaming aggregations, event-time windows, as well as join streaming data to static data. In this session, we'll walk through the basics of Structured Streaming, its programming model and APIs. The concepts will be illustrated using code examples. Then, we'll walk through a demo of analyzing both static and streaming sensor data to show how the same queries can be used on each, thereby simplifying streaming analytics application development, and how static and streaming data can be leveraged together.

Click here for event

0 Response to "February 15: Open Source Analytics - Kansas city, KS - Introduction to Spark Structured Streaming"

Post a Comment

Group Tools

Random Prize Winner
Use this tool to generate random numbers for prize drawings.

Follow this twitter list of the twitter accounts for the user groups. Ask for your group to be added to this list: twitter list
Subscribe to the Kansas City User Group Newspaper at

Blog Archive