Personal tools

ML Algorithms and Models

Berlin_kyline_TV_Tower_River_Spree_092820A
[Berlin Skyline TV Tower River Spree]

 

- Overview

Machine learning (ML) involves the use of ML algorithms and models.

ML algorithms are processes implemented in code and run on data. The ML model is output by the algorithm and consists of model data and prediction algorithm. 

ML algorithms provide a type of automated programming in which ML models represent programs.

Simply put, a ML algorithm is like a recipe that allows computers to learn and predict based on data. Rather than explicitly telling the computer what to do, we feed it large amounts of data and let it discover patterns, relationships, and insights on its own.

 

- Machine Learning Algorithms

Machine learning (ML) algorithms are the brains behind any model, allowing machines to learn, making them smarter. The way these algorithms work is that they are fed an initial batch of data and, over time, additional data is fed as the algorithm improves its accuracy. 

This process of regularly exposing algorithms to new data and experiences increases the overall efficiency of the machine.

Here are some machine learning algorithms:

  • Reinforcement learning: Helps software agents and machines automatically evaluate the best behavior in a given environment.
  • Supervised learning: Provides the model with labeled data to train on. The model learns how to make correct predictions.
  • Semi-supervised learning: Uses a combination of labeled and unlabeled data sets during the training period.
  • Unsupervised learning: Learns the underlying structure of a dataset without being provided a target variable. 
  • Deep learning: Uses different layers of non-linear processing units for feature extraction.
  • Nearest neighbor: Classifies data points based on their similarity to other data points.
  • Gradient boosting: Relies on the idea that the best possible next model, when combined with previous models, minimizes the overall prediction error.
  • Apriori: Generates association rules for a given set of data.


When choosing a machine learning algorithm, you can consider:

  • Your project goal
  • Your data's size, processing, and annotation requirements
  • The speed and training time
  • Your data's linearity
  • The number of features and parameters

 

- Machine Learning Models

A ML model is a program that uses computational methods to make predictions or find patterns in a dataset. ML models are mathematical representations of the output of the training process. 

ML models are built using supervised machine learning algorithms that use labeled, unlabeled, or mixed data. They are similar to mathematical functions, taking input data, making a prediction, and serving a response. 

ML models are different from rule-based programs because they don't need to be explicitly coded and can change over time as new data enters the system.

Here are some steps to building a ML model:
  • Contextualize machine learning in your organization
  • Explore the data and select the type of algorithm
  • Prepare and clean the dataset
  • Split the prepared dataset and perform cross validation
  • Perform machine learning optimization
  • Deploy the model

 

- Model Training

Model training is a stage in the data science development lifecycle. It's the process of running a machine learning algorithm on a dataset, and then optimizing the algorithm to find certain patterns or outputs. 

Model training involves learning good values for all the weights and bias from labeled examples. The resulting function with rules and data structures is called the trained machine learning model. 

The process of training ML models can be divided into four steps:

  • Data set split for training and evaluation
  • Algorithm selection
  • Hyperparameter tuning
  • Model training

 

The model's performance during training will eventually determine how well it will work when it is eventually put into an application for the end-users. 

Before training your model, you can:

  • Identify the problem and candidate algorithms.
  • Identify data required to train the algorithms.
  • Collect initial data.
  • Identify its quality and suitability for the task.
  • Plan what is needed to make the dataset suitable for the project.

 

- Top ML Algorithms

Here are some machine learning algorithms:

  • K-means clustering: A clustering algorithm that groups similar data points into clusters. The number of groups is called K.
  • Reinforcement learning: A machine learning algorithm where a machine learns ideal behavior to maximize its performance.
  • Supervised learning: A machine learning algorithm that uses a known dataset to make predictions.
  • K-Nearest Neighbor (KNN): A machine learning algorithm that solves classification problems by assigning a new data point to a category.
  • Logistic regression: A machine learning classification algorithm that predicts the probability of certain classes based on dependent variables.
  • Support Vector Machines (SVMs): A supervised learning algorithm that is used for classification, regression, and outlier detection tasks.
  • Naive Bayes: A machine learning algorithm based on the idea of using Bayes' theorem to make predictions.
  • Random forest: A supervised learning algorithm that is used for classification and regression.
  • Decision tree: A machine learning algorithm for classification and regression problems.

 

[More to come ...]


Document Actions