big data spain - nov 17 2016 - madrid continuously deploy spark ml and tensorflow ai models: from...
TRANSCRIPT
BIG DATA SPAIN 2016
Continuously Deploy ML and AI Models: From Notebook to Microservice
Thank You, Madrid!Chris Fregly, Research Scientist @
WHO AM I?
Chris Fregly--------
Research Scientist @ PipelineIO(Formerly Netflix and Databricks)
--------http://pipeline.io
WHAT IS PIPELINE.IO?
Extending Your ML Pipelines into Production
100% Open Source!
http://pipeline.io
BRAINSTORMING AND VALIDATING
• Major Gaming Company
• Large Ride Sharing Service
• Popular Q & A Site
• Online Clothing Retailer
• Dominant Video Streaming
PIPELINE.IO FOCUS
• Model Deploying and Testing
• Model Scaling and Serving
• Online Model Training
• Native Code Generation
ONLINE MODEL TRAINING
• Continuous, Incremental, and Partial Training
• Kafka + Spark Streaming + Spark ML
• Real-time, Dynamic Recommendations
PIPELINE.IO FOCUS FOR 2017
• Performance, Performance, Performance
• Native Code Generation: CPU + GPU
• More Global Contributors!
WE’RE HIRING!!
• Kafka, Spark ML, and TensorFlow Contributors
• Systems Engineers
• GPU/CUDA Engineers
• C++, Java, Scala, Python
WE ONLY HIRENICE PEOPLE!!