What is Machine Learning
Machine Learning is a way for computers to learn patterns from data and make predictions or decisions without being explicitly programmed.
Examples
• Email → Spam or Not Spam
• Past house data → Predict house price
• User activity → Recommend videos
Where Do We Use It?
• Recommendation systems (Netflix, YouTube)
• Fraud detection
• Image & speech recognition
• Forecasting (weather, sales)Types of Machine Learning
Machine learning systems can learn in different ways, depending on the type of data and feedback available.
• Supervised Learning
• Unsupervised Learning
• Reinforcement Learning
• Semi-supervised Learning

Supervised Learning
Supervised learning is a type of machine learning where:
• Training data includes correct answers (labels)
• The model learns from past examples to predict answers for new data
How it works
• Input (X): data given to the model
• Output (Y): correct answer
• Train by comparing predictions with known answers
Examples
• Email → Spam / Not spam
• House data → House price
• Student data → Pass / Fail
Regression vs Classification
In supervised learning, we usually solve one of these two problems:

Regression
• Predicts a number
• Output is a continuous value
Examples:
• House features → House price
• Height → Expected weight / numeric value
Classification
• Predicts a category• Output is a discrete label
Examples:
• Transaction → Fraud or Not Fraud
• Email → Spam or Not SpamRegression
Regression is used when we want to predict a number.
Example: Height vs Weight
• Input (Feature): Height
• Output (Label): Weight
Given a person’s height, the model predicts their weight.

Classification
Classification is used when we want to predict a category.
Example: Fraud Detection
• Input (Features): transaction details
◦ amount, location, time, device
• Output (Label): Fraud or Not Fraud
How to Think About It
• The model learns decision boundaries
• Instead of a line predicting a number, it separates data into groups
Features (X) and Labels (Y)
In supervised learning, every example has:
• Features (X): the information we give to the model
• Label (Y): the correct answer we want to predict
Example
• Features (X):
◦ Size of the house
◦ Location
◦ Number of rooms
• Label (Y):
◦ House price
Why Features Matter
• Models learn only from the features you provide
• Better features → better predictionsFeature Scaling
Feature scaling is about bringing numerical features to a similar range so models can learn fairly.
Why Scaling Matters
• Features can have very different ranges
• Large numbers can dominate small numbers
• Scaling helps models learn patterns more effectively
Normalization
We scale values to a 0–1 range:
Normalised value = (value − min) / (max − min)
After normalisation:
• 1,000 → 0.0
• 5,000 → 0.44
• 10,000 → 1.0
|
Employee |
Salary |
Normalized |
|
A |
1,000 |
0 |
|
B |
5,000 |
0.44 |
|
C |
10,000 |
1.0 |
Feature Encoding
Feature encoding is about converting non-numerical data into numbers so models can use it.
Why Encoding Is Needed
• Machine learning models work with numbers only
• Text and categories must be represented numerically
Label Encoding
|
City |
value |
|
Colombo |
0 |
|
Kandy |
1 |
|
Galle |
2 |
One-Hot Encoding
|
Colombo |
Kandy |
Galle |
|
1 |
0 |
0 |
|
0 |
1 |
0 |
|
0 |
0 |
1 |
Unsupervised Learning
Unsupervised learning is used when:
• Data has no labels
• We want to discover patterns in the data
Common Goals
• Group similar data points together
• Understand the structure of the data
Example: Customer Segmentation
• Input: customer behaviour data
• Output: groups of similar customers
Clustering
Clustering groups similar data points together.
How It Works
• Each data point has features
• Similar points are placed in the same group
Use Cases
• Customer groups
• Document grouping
• Image grouping
Anomaly Detection
Identifies data points that behave differently from the rest.
How It Works
• Learn what “normal” data looks like
• Flag points that significantly deviate from normal patterns
Use Cases
•Fraud detection (unusual transactions)
•System monitoring
•Security alerts
Reinforcement Learning
Reinforcement learning is about learning what to do through experience.

• An agent takes actions in an environment
• Actions lead to outcomes
• Good outcomes are rewarded
• Bad outcomes are penalised
Goals
• Balance exploration (trying new actions) and exploitation (using what works)
• Maximize long-term reward while balancing exploration vs exploitation.
Examples
• Learning to play chess by playing games
• Learning driving strategies in simulation
• Training a system to make better decisions over time (traffic lights or recommendations)
Time Series Forecasting
Time series problems involve:
• Data collected over time
• Predicting future values from past data
Examples
• Weather forecasting
• Stock prices
• Sales forecasting
Same supervised learning idea, but time order matters.
Deep Learning
Deep learning is a subset of machine learning that uses neural networks.
Why Deep Learning?
• Traditional ML struggles with raw data
• Images, text, and audio are very complex
What It’s Used For
• Image recognition
• Speech recognition
• Large Language Models (LLMs)
Key Takeaways
• Machine learning learns patterns from data
• Supervised learning is the most common in the industry
• Data and features often matter as much as the algorithm
• Different learning types solve different problems
• Focus on the problem + data first, then choose the model
Author: Tharindu Perera is a Tech Lead at CMS with a passion for software architecture, machine learning, and AI.
