Tech workshop: “Spark MLlib: Past, Present, and Future”

By 湾区同学技术沙龙（www.tech-meetup.com）

Date and time

Saturday, October 3, 2015 · 1:30 - 4pm PDT

Location

HanHai Investment

97 E Brokaw Rd, Ste 210 San Jose, CA 95112

Refund Policy

Contact the organizer to request a refund.

Description

Tech workshop: “Spark MLlib: Past, Present, and Future”

Links:

Registration link: http://tech-meetup-10-03-2015.eventbrite.com
Event link: http://www.tech-meetup.com/events/10-03-2015
Join tech-meetup community:

LinkedIn group: https://www.linkedin.com/grp/members?gid=8362423
Google group：http://www.tech-meetup.com/signup
微信群：http://www.tech-meetup.com/wechat

Time: 1:30PM ~ 4:00PM, 10/03/2015, Saturday

Location: 97 E Brokaw Rd, Ste 210, San Jose, CA 95112

Agenda:

1:30pm - 2:00pm: Reception and social time

2:00pm - 3:30pm: Talk and QA

3:30pm - 4:00pm: offline networking

Tech Talks Abstract:

Apache Spark provides primitives for in-memory cluster computing, which is well suited for large-scale machine learning purposes. MLlib is a standard component in Spark providing machine learning primitives, initially created and contributed to Spark by UC Berkeley. With 50+ companies and 180+ individual developers contributing to MLlib, it is one of the most active open source projects for machine learning. MLlib’s goal is to make practical machine learning scalable and easy, and the community has devoted lot of time and effort towards this goal. In this talk, we present a brief history of MLlib, summarize new features in Spark 1.5, and discuss the roadmap. We will show the expansion of MLlib’s feature set, the evolution of MLlib’s pipeline API, the elevation of MLlib’s performance, as well as the integration with other Spark components. We will also provide entry points for users and developers to get started with Spark MLlib.

Speaker’ bio:

Xiangrui Meng is an Apache Spark PMC member and a software engineer at Databricks. His main interests center around developing and implementing scalable algorithms for scientific applications. He has been actively involved in the development and maintenance of Spark MLlib since he joined Databricks. Before Databricks, he worked as an applied research engineer at LinkedIn, where he was the main developer of an offline machine learning framework in Hadoop MapReduce. His Ph.D. work at Stanford is on randomized algorithms for large-scale linear regression problems.

Ticket Info:

In the past events, many persons registered but didn’t present. It is hard for us to get the accurate participant number before each event, and it wasted a lot of opportunities for others who really wanted to attend. So, from this event, we introduce the refundable tickets: full refund if you present, but non-refundable if absent. The first 150 tickets are still “backward-compatible”: totally free.

主办

湾区同学技术沙龙（www.tech-meetup.com）

协办

南京大学硅谷校友会

瀚海硅谷科技园

硅谷清华联网

中国科技大学校友会创业俱乐部

浙江大学校友会海纳创新创业俱乐部

北京大学北加州校友会

武汉大学北加州校友会

东南大学硅谷校友会

吉林大学硅谷校友会

复旦大学北加州校友会

华人事业互助会

中美创新协会(CHAIN)

Organized by

湾区同学技术沙龙（www.tech-meetup.com）

湾区数年，来去间尽是同窗故旧。闲侃中发现，多年来，不少人的志趣依然相投。我们几位朋友合议下，于是有了这么个技术沙龙。我们定位于技术和业界方面的讨论，同时也提供一个机会大家碰碰面，聊聊天，互通有无。至于告知我们，我们可以商讨提供讲座的机会；想要讨论某些方面内容的，请告诉我们，我们可以寻找相应的专家提供。

Sales Ended