12 Jan 22:42

SynapseML v0.9.5

Highlights


Geospatial Intelligence	Multivariate Anomaly Detection	Responsible AI at Scale	Text To Speech	Healthcare Analytics
Large-scale map and geocoding operations	Build custom time series anomaly detection systems	Distributed Conditional Expectation and Partial Dependence Analysis	East-to-use Neural Text to Speech for large datasets	Quickly understand entities and relationships in corpora of medical text.

New Features

Added support for distributed geospatial queries backed by the Azure Maps API
Added the geospatial usage overview (#1339)
Explore how to use the geospatial intelligence services to analyze flood risks. (#1339)
Added the AddressGeocoder transformer to map informal addresses to standardized adresses with latitude and longitude (#1294)
Added the ReverseGeocoder transformer to map latitude and longitude measurements to standardized addresses. (#1339)
Added the CheckPointInPolygon, to detect if latitude and longitude queries lie inside regions of interest (#1339)

Added the Healthcare Analytics Transformer for extracting medical information, entities, and relationships for text. [Example Usage] (#1329)
Added the FitMultivariateAnomaly estimator for training custom anomaly detection models on DataFrames of multivariate time series data (#1272)
Added example notebook for Multivariate Anomaly Detector
See how to train a custom Multivariate Anomaly detector in the Estimators reference docs (#1323)
Added simplified Text Analytics transformers that support auto-batching (#1329)
Added the TextToSpeech Transformer for transforming Dataframes of text to audio files with neural voice synthesis (#1320)
Added the TextAnalyze transformer to support executing multiple text analytics workloads within a single API call (#1267, #1312)

Added Individual Conditional Expectation explanations and Partial Dependence Plots with the ICETransformer. This tool gives detailed explanations of how features in opaque-box models affect the model prediction. (#1284)
Learn about how to use the ICETransformer through an example with the Adult Census dataset

Improved LightGBM training performance 4x-10x by setting num_threads to be cores-1 (#1282)
Added the predict_disable_shape_check in LightGBM (#1273)
Reduced temporary file bloat by creating the LightGBM native temp directory lazily (#1326)
Added logging for number of columns and rows when creating datasets, set useSingleDatasetMode=True by default (#1222)

Allowed FlattenBatch to propagate non-array values (#1286)
Fixed flaky tests (#1342)
Fixed website bugs and migrated docSearch (#1331)
Fixed issue where IsolationForestModel does not properly exchange params with the inner model (#1330)
Corrected the objective param when using fobj (#1292)
Fixed issue where broadcasted sum in breeze 1.0 breaks in Spark 3.2.0 (#1299)
Hotfixes for R test runners (#1283)
fix installation instruction (#1268)
Removing broadcast hint (#1255)
fix install instructions (#1259)

We are excited to highlight the contributions of the following SynapseML contributors:


Serena Ruan	Ilya Matiach	Sudhindra Kovalam
Serena is an engineer on the Azure Synapse team in Beijing. In this release, Serena has continued her unbelievable speed of contributions with support for Multivariate Anomaly Detection, MLFlow, and installation from Maven Central. These contributions are just a few of the many projects Serena has contributed since she joined just a few months ago!	Ilya is a prolific engineer on the Azure Machine Learning Boston team working on responsible AI. Ilya contributed LightGBM on Spark and worked tirelessly to improve and support this feature. Ilya has been an active contributor to the SynapseML project for 5 years and has built many of the tools in the library.	Sudhindra is an engineer on the Microsoft Maps team and has contributed intelligent geospatial APIs to SynapseML v0.9.5. Sudhindra developed new ways to automate generation of Spa...

nhymxu, martin0258, and 19 other contributors

Assets 2

16 Nov 05:19

SynapseML v0.9.4


General Availability on Synapse	ONNX on Spark	Responsible AI	Form Recognition and Translation	Reinforcement Learning
We are ready to help you productionalize on Azure Synapse Analytics	Distributed and hardware accelerated model inference on Spark	Understand opaque-box models, measure dataset biases, Explainable Boosting Machines	Parse PDFs and translate dataframes between over 100 languages	Contextual Bandit Reinforcement Learning with Vowpal Wabbit