Skip to content

My Personal Blog

Machine Learning and beyond

Menu
  • Artificial Intelligence
    • GenAI, Agentic Workflows & Knowledge Base with Amazon Bedrock
    • Building RAG Machine Learning Template on AWS
    • Reinforcement Learning
    • Computer Vision – AWS Rekognition
    • AWS Sagemaker AI Studio User Journey
    • MLOps Orchestrator with AWS Sagemaker AI
  • MACHINE LEARNING
    • Python for Data Science: Unsupervised Learning Algorithm
    • Python for Data Science: Supervised Learning Algorithm
    • Python for Data Science: Machine Learning
    • Supervised Machine Learning: Student Performance analysis using supervised learning algorithm
    • Unsupervised Machine Learning: Clustering using K-Means Algorithm
    • Unsupervised Machine Learning: Image compression using K-Means Cluster Algorithm
    • Unsupervised Machine Learning: Image compression using K-Means Cluster Algorithm
  • adventures
    • snowdonia-wales
    • Santini600
  • TRIATHLON
    • 2019 Ironman 70.3 Asia Pacific Championship, Vietnam
    • Race Report: 2019 Ironman Western Sydney
    • 2017 Ironman 70.3 Bintan
    • 2017 Perth Marathon
    • 2016 Ironman 70.3 Western Sydney
  • About Me
  • Let’s Connect
Menu

Python for Data Science: Machine Learning

Posted on May 10, 2020August 14, 2025 by pluto gasanova


What is Machine Learning?

In essence, it is a type of Artificial Intelligence that extract patterns out of raw data by using an algorithm or method. The key focus of Machine Learning is to allow computer system to learn from experience without human intervention or being explicitly programmed.
Two categories of Machine Learning are: Supervised Learning and Unsupervised Learning.


1. Supervised Learning

It is one paradigm of Machine Learning where the algorithm is used to best estimate the mapping function ( f ) from an input set of data (X) in order to predict the output (Y) for that set of data.

Y = f (X)

The output (Y) are labelled and the Supervised Learning algorithm are trained using labelled data.


Supervised Learning can be grouped into:

          1.1. Classification
          1.2. Regression

By understanding whether the Machine Learning task is a classification or regression is key for selecting the right Supervised Learning algorithm to use.


1.1. Classification

Classification involves assigning the target feature a Label. The Classification algorithm is then used to estimate the mapping function (f) from an input set of data (X) in order to obtain discrete output (Y).
The target examples of binomial classification are: 1 or 0, Yes or No.
The target examples of multinomial classification are: Positive or Negative or Neutral.


Real life examples:

  • Email Spam Classification: Given a dataset of emails, the aim is to predict which incoming email is labelled as Spam or No Spam
  • Weather Prediction: Given a dataset of weather, the aim is to predict the rain will happen tomorrow at 9am is labelled as Yes or No
  • Credit Approval Classification: Given a dataset of thousands of credit applications which contain different attributes such as Annual Salary, Debt history, Age, credit rating etc. The aim is to predict which credit application falls under Approved or Not Approved or Need Review
  • Sentiment Analysis Classification: Given a dataset of users tweets on one issue on Twitters of several cities. The aim is to predict which city has Positive or Negative or Neutral sentiment.


1.2. Regression

Regression involves assigning the target feature a Label. The Regression algorithm is then used to estimate the mapping function (f) from an input set of data (X) in order to obtain continuous numeric value output (Y).
Real life examples:

  • Stock Market: Given a dataset of a share price of previous time, the aim is to predict the price of the share tomorrow in the form of price range between lowest price and highest price.
  • Weather Prediction: Given a dataset of rainfall intensity of previous time, the aim is to predict the amount of rain that will occur tomorrow. 
  • House Price: Given a dataset of housing price of previous time, the aim is to predict the price range of the house.
  • Press publishing: Given a dataset of number of pageviews of various article topics, the aim is to predict the range of  number of pageviews of the new article.


2. Unsupervised Learning

It is another paradigm of Machine Learning which looks at recognizing a pattern from the given dataset.
The output are not labelled and the Unsupervised Learning algorithm develops a learning through observation and finds all kind of patterns in data.

The Unsupervised Learning can be grouped into:

          2.1. Clustering
          2.2. Association

By understanding whether the Machine Learning task is a Clustering or Association is key for selecting the right Unsupervised Learning algorithm to use.


2.1. Clustering

Clustering is the Unsupervised Learning that involves the detection of potentially useful clusters in the data. The Clustering algorithm is then used to discover the inherent group as cluster in the data.
Real life examples:

  • Customer Grouping: Given a dataset of customer purchase records, the aim is to group the customers by purchasing behavior. 


2.2. Association

Association is the Unsupervised Learning that involves the detection of association rule in the data. The Association rule algorithm is then used to discover the rule that describes large portions of data.

Real life examples:

  • Product Association: Given a dataset of customer purchase records, the aim is to associate customers such that people who buy product A tends to buy product B

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Recent Posts

  • GenAI, Agentic Workflows & Knowledge Base with Amazon Bedrock
  • About Me
  • Let’s Connect
  • Hiking to Snowdon Summit – Spring 2025
  • Santini 600 : Cycling 600 km in 24 hours for a Good Cause

Archives

  • August 2025
  • June 2025
  • May 2025
  • March 2022
  • June 2020
  • May 2020
  • November 2019
  • June 2019
  • September 2017
  • July 2017
  • December 2016
© 2025 My Personal Blog | Powered by Superbs Personal Blog theme