Microsoft Updates New Machine Learning Platform for Apache Spark
This week Microsoft Announced that is has released version 0.16 of its new deep learning data science tool for Spark, Microsoft Machine Learning for Apache Spark, (MMLSpark) on Github.
MMLSpark requires Scala, Spark and Python, and works with Microsoft Cognitive Services and Azure Databricks. It was originally released two years ago, with the most recent version before this -- .015 -- coming in November.
New features and improvements in version 0.16 include support for Spark deep learning pipelines, a new "ranking train validation splitter," better integration with Azure Search, support for name entry recognition cognitive service on Spark (for analytical text extraction), improved boosting capabilities with the gradient boosting tool for tree-based algorithms LightGBM, as well as many other changes.
More information on MMLSpark can be found on the Microsoft product page here.
About the Author
Becky Nagel is the vice president of Web & Digital Strategy for 1105's Converge360 Group, where she oversees the front-end Web team and deals with all aspects of digital projects at the company, including launching and running the group's popular virtual summit and Coffee talk series . She an experienced tech journalist (20 years), and before her current position, was the editorial director of the group's sites. A few years ago she gave a talk at a leading technical publishers conference about how changes in Web browser technology would impact online advertising for publishers. Follow her on twitter @beckynagel.