myVertica  

Vertica Blog

Machine Learning

What’s New in Vertica 8.1.1: Machine Learning

This blog post was authored by Soniya Shah. Vertica 8.1.1 continues with the fast-paced development for machine learning. In this release, we introduce the highly-requested random forest algorithm. We added support for SVM to include SVM for regression, in addition to the existing SVM for classification algorithm. L2 regularization was added to both the linear […]

Machine Learning Mondays: How Vertica Implements Efficient and Scalable Machine Learning

This blog post was authored by Vincent Xu. As of Vertica 8.1, Vertica has introduced a set of popular machine learning algorithms, including Linear Regression, Logistic Regression, Kmeans, Naïve Bayes, and SVM. Based on our recent benchmarks, they run faster than MLlib on Apache Spark. The following chart shows the performance difference between Vertica 8.1.0 […]

Machine Learning Mondays: Data Preparation for Machine Learning in Vertica

This blog post was authored by Vincent Xu. This post is part of our Machine Learning Mondays series. Stay tuned for more! Introduction Machine learning (ML) is an iterative process. From understanding data, preparing data, building models, testing models to deploying models, every step of the way requires careful examination and manipulation of the data. […]

What’s New in Vertica 8.1: Machine Learning

This blog post was authored by Soniya Shah. Overall, you will notice that Machine Learning for Predictive Analytics, introduced in Vertica 7.2.2, is more accessible to use in Vertica 8.1, with the addition of several important functions. There are improvements to model management with access control ability to save and re-apply normalization parameters, missing value […]

Vertica Machine Learning Series: Logistic Regression

This blog post is based on a white paper authored by Maurizio Felici. What is Logistic Regression? Logistic regression is a popular machine learning algorithm used for binary classification. Logistic regression labels a sample with one of two possible classes, given a set of predictors in the sample. Optionally, the output can be the probability […]

Vertica Machine Learning Series: k-means

The content of this blog is based on a white paper that was authored by Maurizio Felici. What is k-means Clustering? K-means clustering is an unsupervised learning algorithm that clusters data into groups based on their similarity. Using k-means, you can find k clusters of data, represented by centroids. As the user, you select the […]

Machine Learning Series: Linear Regression

The content of this blog is based on a white paper that was authored by Maurizio Felici. This blog post is just one in a series of blog posts about the machine learning algorithms in Vertica. Stay tuned for more! What is Linear Regression? Let’s start with the basics. Linear regression is one of the […]

What’s New in Vertica 8.0: Machine Learning Enhancements

Overall, you will notice that Machine Learning for Predictive Analytics, introduced in Vertica 7.2.2, is more accessible and intuitive to use in Vertica 8.0. The API has been streamlined, and it’s easier to get up and running. Also, there are additional options for data preparation.

Watch Machine Learning for Predictive Analytics in Action

Watch this video to learn more about the Vertica Machine Learning for Predictive Analytics features new in 7.2

Learn More From Your Data with Machine Learning Algorithms

New in Vertica 7.2.2 is the Machine Learning for Predictive Analytics package. This analytics package allows you to use built-in machine learning algorithms on data in your Vertica database. Machine learning algorithms are extremely valuable in data analytics because, as their name suggests, they can learn from your data and provide information about deductive and […]