General Introduction to Machine Learning.

Faheem Khan
5 min readJun 1, 2023

--

Types of ML Algorithms and there categories.

→ General Introduction to Machine Learning.

Types of Machine Learning Algorithms:

  1. Supervised Machine Learning.
  2. Unsupervised Machine Learning.
  3. Semi-Supervised Machine Learning.
  4. Reinforcement Machine Learning.

1. Supervised Machine Learning:

Supervised machine learning is a subfield of artificial intelligence (AI) and machine learning (ML) that involves training a model on labeled data to make predictions or decisions. In supervised learning, we have a dataset that consists of input features and corresponding output labels or target variables. The goal is to build a model that can learn the relationship between the input features and the output labels, allowing it to make accurate predictions on new, unseen data.

Supervised learning algorithms can be broadly categorized into two main types: regression and classification.

  1. Regression: Regression algorithms are used when the target variable is continuous or numerical. The goal is to predict a value based on the input features. Examples include predicting housing prices based on features like location, size, and number of rooms, or predicting a person’s income based on their education, experience, and age.
  2. Classification: Classification algorithms are used when the target variable is categorical or belongs to a discrete set of classes. The goal is to assign a class label to an input instance based on its features. Examples include classifying emails as spam or not spam based on their content, or classifying images of handwritten digits into their respective numerical classes.

2. Unsupervised Machine Learning:

Unsupervised machine learning is a subfield of artificial intelligence (AI) and machine learning (ML) that deals with analyzing and extracting insights from unlabeled data. Unlike supervised learning, unsupervised learning does not have explicit output labels or target variables associated with the input data. Instead, the goal is to discover patterns, structures, or relationships within the data itself.

There are several common approaches to unsupervised learning:

  1. Clustering: Clustering algorithms aim to group similar data points together based on their inherent characteristics or distances between them. The goal is to identify natural clusters or partitions within the data. Examples of clustering algorithms include k-means, hierarchical clustering, and DBSCAN.
  2. Dimensionality Reduction: Dimensionality reduction techniques are used to reduce the number of input features or variables while preserving the essential information in the data. This is particularly useful when dealing with high-dimensional data, as it can help visualize and understand the data, reduce computational complexity, and remove noise or irrelevant features. Techniques such as Principal Component Analysis (PCA) and t-SNE (t-distributed Stochastic Neighbor Embedding) are commonly used for dimensionality reduction.
  3. Anomaly Detection: Anomaly detection algorithms aim to identify rare or unusual data points that deviate significantly from the norm. These algorithms learn patterns or representations of normal data and flag instances that are considered anomalous or outliers. Anomaly detection is commonly used in fraud detection, network security, and system monitoring.
  4. Association Rule Mining: Association rule mining is concerned with discovering interesting relationships or associations among items or variables in a dataset. It involves finding frequent patterns or co-occurrences of items and extracting rules that capture these relationships. This technique is often used in market basket analysis, where associations between products in a shopping cart are studied to understand customer behavior and make recommendations.

3. Semi-Supervised Machine Learning:

Semi-supervised machine learning is a combination of supervised and unsupervised learning approaches. In this setting, we have a dataset that contains both labeled and unlabeled data. The labeled data consists of input features along with their corresponding output labels, while the unlabeled data only consists of input features without any associated labels.

There are different approaches to semi-supervised learning:

  1. Self-training: In self-training, an initial model is trained using the labeled data. This model is then used to make predictions on the unlabeled data, and the confident predictions are treated as pseudo-labeled data. The pseudo-labeled data is combined with the original labeled data, and the model is retrained on this augmented dataset. This process iteratively continues, refining the model’s predictions and updating the pseudo-labeled data.
  2. Co-training: Co-training is applicable when the input features can be divided into multiple views or perspectives, and each view provides complementary information. Initially, two or more separate models are trained using different subsets of the features. These models then make predictions on the unlabeled data, and their confident predictions are used to label the unlabeled instances. The newly labeled data is added to the original labeled data, and the models are retrained on the expanded dataset. This process iterates, with each model learning from the other models’ predictions.
  3. Generative models: Generative models, such as generative adversarial networks (GANs) or variational autoencoders (VAEs), can be used in semi-supervised learning. These models aim to learn the underlying data distribution and generate realistic samples from it. By incorporating both labeled and unlabeled data, the generative model can capture the data distribution more effectively, enabling better discrimination between classes.

4. Reinforcement Learning:

Reinforcement learning (RL) is a branch of machine learning that deals with how an agent can learn to make sequential decisions in an environment to maximize a cumulative reward signal. In reinforcement learning, the agent learns through a process of trial and error, taking actions in the environment and receiving feedback in the form of rewards or punishments.

The key components in reinforcement learning are as follows:

  1. Agent: The agent is the learner or decision-maker that interacts with the environment. It takes actions based on its observations and receives feedback in the form of rewards or penalties.
  2. Environment: The environment is the external system or world in which the agent operates. It provides the agent with observations, and in response to the agent’s actions, it produces new observations and rewards.
  3. State: The state represents the current situation or condition of the environment. It captures the relevant information that the agent needs to make decisions.
  4. Action: The action is a decision made by the agent in response to the observed state. The set of available actions depends on the specific problem and can be discrete or continuous.
  5. Reward: The reward is a numerical signal provided by the environment to indicate the desirability or quality of the agent’s action. The agent’s objective is to maximize the cumulative reward it receives over time.

Reinforcement learning algorithms can be categorized into two main types:

  1. Value-based methods: In value-based methods, the agent learns the value function, which estimates the expected cumulative reward starting from a particular state and following a specific policy. Q-learning and deep Q-networks (DQNs) are examples of value-based algorithms.
  2. Policy-based methods: In policy-based methods, the agent directly learns the policy function, which maps states to actions without explicitly estimating value functions. Policy gradient algorithms, such as REINFORCE and Proximal Policy Optimization (PPO), are examples of policy-based algorithms.

--

--

Faheem Khan
Faheem Khan

Written by Faheem Khan

AI/ML Engineer | Writing on algorithms, data, and AI’s transformative impact. Passionate about simplifying complexity and inspiring innovation. Let’s create!

No responses yet