UPDATE: Due to the coronavirus pandemic we are canceling this meeting. We intend to reschedule sometime during the summer (July timeframe)
Our spring event will be hosted at MISO Energy in Carmel. If you are attending the event please register by clicking the "Register Now" button below. Your registration ensures appropriate head count for meals and snacks.
Thank you. We hope to see you at the meeting!
This Event is Sponsored by-
9.15 - 9.30 AM - Welcome and announcements
9.30 -10.30 AM - Data Governance essentials
- Chris Miller, Data Governance and Data Quality Leader @ OneAmerica
In this session Chris will describe essential functions of a Data Governance practice, and walk through how to deploy a program based on your business' need.
10.30 -11.30 AM - Data Engineering with StreamSets DataOps Platform
- Intro to StreamSets - Dan Matic, Regional Sales Director @ StreamSets
- Primer on Transformer and Demo - Adam Bracey, Solutions Engineer @ StreamSets
- Transformer Extensibility and Demo - Dash Desai, Platform Evangelist @ StreamSets
StreamSets Transformer is an execution engine within the StreamSets DataOps platform that allows users to create data processing pipelines that execute on Spark. Using a simple to use drag and drop UI users can create pipelines for performing ETL, stream processing and machine learning operations. It allows everyone, not just the savvy Spark developers, but also the Data Analysts, Data Scientists or legacy ETL developers to fully utilize the power of Apache Spark without requiring a deep technical understanding of the platform with minimal operational and configuration overhead.
11.30 AM - 1 PM - Networking lunch, courtesy of Streamsets
1 - 3 PM - Big Data and Machine Learning Update 2020
Last year we discussed how Big Data and Machine learning have moved far beyond research and ad hoc analysis and have emerged to become integral parts of production systems. We talked about the complexities of building predictive models, managing the data used to produce them, aspects associated with “allowing the systems to learn”, and gauging their performance.
This year, after an overview of what’s new in Big Data and Machine Learning, we are going to take a deep dive into Big Data database technologies. We will discuss how these technologies compare with object storage, and how they fit into scalable on-premise, cloud-based, or hybrid object stores. We will talk about the familiar technologies, like Hive and Impala; how they evolved with time, and some of practical use cases for them. We will discuss alternatives to the management of Big Data with SQL-like databases, such as Spark’s datasets and data frames. We will also look at new trends such as the “growing-in-popularity” technologies like in-memory databases (such as Arrow) and graph databases.
We hope that this session will give decision-making information to DevOps and Data Scientists for determining alternate approaches to Big Data projects.
3 PM - Wrap-Up and door prize drawing
Paid members of DAMA Indiana are admitted to events at no cost. Non-member cost is $25. If you are interested in becoming a member of DAMA Indiana, please discuss with a board member at the event. Corporate Membership is $195 annually. Individual Membership is $75 annually. Students are admitted free.
We hope to see you at the meeting!
CARMEL, IN 46032-3826
|Price Per Seat||$ 25.00|