Unveiling Machine Learning: How Machines Learn from Data

July 21, 2025 | ai | python | machine learn |

Machine Learning (ML) is a fascinating field where computers learn from data without being explicitly programmed. Instead of writing rigid rules for every possible scenario, we feed algorithms data, and they figure out patterns, make predictions, or take actions based on what they’ve “learned.” It’s like teaching a child by showing them examples rather than giving them a rulebook.

At its core, ML is about building models that can generalize from observed data to unseen data. This ability to learn and adapt is what makes machine learning so powerful and transformative across industries, from self-driving cars to personalized recommendations.

Let’s explore the main ways machines learn, using a relatable example: a simple recommendation system for an online streaming service (like Netflix or Spotify).

Forms of Learning in Machine Learning

The world of machine learning is broadly divided into a few key learning paradigms, each suited for different types of problems and data.

1. Supervised Learning: Learning from Labeled Examples

Supervised learning is the most common type of machine learning. Here, the algorithm learns from a labeled dataset, which means each piece of training data comes with the correct “answer” or outcome. The goal is for the model to learn a mapping from input features to output labels so it can predict labels for new, unseen data.

Analogy: Imagine you’re teaching a child to identify fruits. You show them pictures of apples and say “This is an apple.” Then you show them bananas and say “This is a banana.” After many examples, the child learns to identify new fruits on their own. The “labels” here are the fruit names.

Example: Recommending Movies Based on User Ratings (Classification/Regression)

Let’s say our streaming service wants to recommend movies that a user will like. We have historical data where users have rated movies (e.g., on a scale of 1 to 5 stars).

Classification: If we simplify and just want to predict if a user will “Like” (4-5 stars) or “Dislike” (1-3 stars) a movie.
Regression: If we want to predict the exact rating (1-5 stars) a user will give a movie.

We’ll focus on a classification scenario: predicting if a user will “like” a movie (binary outcome).

import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.neighbors import KNeighborsClassifier
from sklearn.metrics import accuracy_score

# Sample data: User features (e.g., genre preferences, watch history length) and whether they liked a movie (1=liked, 0=disliked)
# Let's say: Feature 1 = User's average rating for Action movies, Feature 2 = User's average rating for Comedy movies
movie_data = {
    'action_rating': [4.5, 2.0, 3.8, 4.0, 2.5, 4.2, 1.5, 3.0, 4.8, 2.2],
    'comedy_rating': [2.0, 4.0, 3.0, 1.8, 4.5, 2.5, 3.5, 4.2, 1.5, 3.8],
    'user_liked_movie': [1, 0, 1, 1, 0, 1, 0, 0, 1, 0] # 1 means liked, 0 means disliked
}
df_movies = pd.DataFrame(movie_data)

X = df_movies[['action_rating', 'comedy_rating']]
y = df_movies['user_liked_movie']

# Split data into training and testing sets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)

# Create a K-Nearest Neighbors Classifier model
# This model classifies new data points based on the majority class of their 'k' nearest neighbors in the training data.
knn_model = KNeighborsClassifier(n_neighbors=3)

# Train the model
knn_model.fit(X_train, y_train)

# Make predictions on the test set
y_pred = knn_model.predict(X_test)

# Evaluate the model
accuracy = accuracy_score(y_test, y_pred)
print(f"Supervised Learning (KNN) Accuracy: {accuracy:.2f}")

# Predict for a new user: average 3.5 for Action, 2.0 for Comedy
new_user_features = pd.DataFrame([[3.5, 2.0]], columns=['action_rating', 'comedy_rating'])
prediction = knn_model.predict(new_user_features)
print(f"Prediction for new user (Action: 3.5, Comedy: 2.0): {'Will Like Movie' if prediction[0] == 1 else 'Will Dislike Movie'}")

Here, KNeighborsClassifier learns from labeled examples (user_liked_movie) to predict new ones.

2. Unsupervised Learning: Finding Patterns in Unlabeled Data

Unsupervised learning deals with unlabeled data. The algorithms try to find hidden structures, patterns, or relationships within the data on their own. There’s no “correct” answer to learn from; instead, the model explores the data’s inherent organization.

Analogy: Imagine giving a child a box of assorted toys and asking them to sort them into groups without telling them how. They might group them by color, size, type (cars, dolls), or material. They’re finding patterns on their own.

Example: Discovering User Segments (Clustering)

For our streaming service, we might have thousands of users, but we don’t know if there are natural groups among them. Unsupervised learning, specifically clustering, can help us discover “user segments” based on their watching habits, without pre-defining those segments. This can inform targeted marketing or content recommendations.

Let’s use the same action_rating and comedy_rating features, but this time, we don’t provide the user_liked_movie label.

from sklearn.cluster import KMeans
import matplotlib.pyplot as plt

# Using the same feature data as before, but without the 'user_liked_movie' label
X_unsupervised = df_movies[['action_rating', 'comedy_rating']]

# Create a K-Means Clustering model (let's say we want to find 3 user segments)
kmeans = KMeans(n_clusters=3, random_state=42, n_init=10) # n_init for modern KMeans

# Fit the model and get cluster assignments for each user
clusters = kmeans.fit_predict(X_unsupervised)
df_movies['user_segment'] = clusters # Add cluster assignments back to the DataFrame

print("\nUsers with assigned segments (unsupervised learning):")
print(df_movies)

# Visualize the clusters (optional)
plt.figure(figsize=(8, 6))
for i in range(3): # For each cluster
    segment_data = df_movies[df_movies['user_segment'] == i]
    plt.scatter(segment_data['action_rating'], segment_data['comedy_rating'], label=f'Segment {i}')
plt.xlabel('Average Action Movie Rating')
plt.ylabel('Average Comedy Movie Rating')
plt.title('User Segments Based on Genre Ratings')
plt.legend()
plt.grid(True)
plt.show()

KMeans identifies natural groupings of users based on their genre ratings without needing explicit labels.

3. Reinforcement Learning: Learning Through Trial and Error

Reinforcement learning (RL) is about an agent learning to make sequential decisions in an environment to maximize a cumulative reward. It’s like teaching a pet tricks: you reward them for desired behaviors, and they learn through trial and error what actions lead to positive outcomes.

Analogy: Imagine our streaming service has an AI agent that recommends the next movie to watch after a user finishes one. If the user watches the recommended movie and rates it highly, the agent gets a “reward.” If they close the app or rate it poorly, it’s a “penalty.” Over many trials, the agent learns a strategy (a “policy”) for making the best recommendations.

Example: Dynamic Movie Recommendation Agent

Reinforcement learning typically involves more complex simulations and environments than can be easily demonstrated with simple scikit-learn code. It often uses libraries like OpenAI Gym or Stable Baselines3.

Here’s a conceptual Python example demonstrating the idea of an RL agent learning:

# This is a conceptual example, as full RL environments are complex.
# We're simulating a very simple 'environment' for demonstration.

class StreamingEnvironment:
    def __init__(self):
        self.satisfaction = 0 # Cumulative reward
        self.recommendation_count = 0
        self.movie_database = {
            'Action': 0.7, # 70% chance of user liking an action movie if recommended
            'Comedy': 0.6,
            'Drama': 0.8
        }

    def recommend_movie(self, genre_choice):
        self.recommendation_count += 1
        # Simulate user's response based on genre choice
        if random.random() < self.movie_database.get(genre_choice, 0.5): # Default 0.5 if genre not found
            reward = 1 # User liked it
            print(f"Recommended {genre_choice}. User liked it! (+1 reward)")
        else:
            reward = -1 # User disliked it
            print(f"Recommended {genre_choice}. User disliked it! (-1 reward)")
        self.satisfaction += reward
        return reward

import random

# Simple RL "Agent" that tries to find the best genre to recommend
class SimpleRecommendationAgent:
    def __init__(self, genres):
        self.genres = genres
        self.genre_rewards = {genre: 0 for genre in genres} # Keep track of total reward for each genre
        self.genre_counts = {genre: 0 for genre in genres} # Keep track of how many times each genre was recommended

    def choose_action(self):
        # A simple strategy: choose genre that has historically given the best average reward
        if all(count == 0 for count in self.genre_counts.values()): # If no history, pick randomly
            return random.choice(self.genres)
        
        # Choose the genre with the highest average reward
        best_genre = None
        max_avg_reward = -float('inf')
        for genre in self.genres:
            if self.genre_counts[genre] > 0:
                avg_reward = self.genre_rewards[genre] / self.genre_counts[genre]
                if avg_reward > max_avg_reward:
                    max_avg_reward = avg_reward
                    best_genre = genre
            else: # Prioritize genres not yet explored, or explore randomly
                 return random.choice(self.genres) # Simple exploration
        return best_genre if best_genre else random.choice(self.genres) # Fallback

    def learn(self, genre_chosen, reward):
        self.genre_rewards[genre_chosen] += reward
        self.genre_counts[genre_chosen] += 1

print("\n--- Reinforcement Learning Simulation ---")
env = StreamingEnvironment()
agent = SimpleRecommendationAgent(list(env.movie_database.keys()))

num_recommendations = 10

for i in range(num_recommendations):
    chosen_genre = agent.choose_action()
    reward = env.recommend_movie(chosen_genre)
    agent.learn(chosen_genre, reward)

print(f"\nTotal user satisfaction after {num_recommendations} recommendations: {env.satisfaction}")
print("Agent's learned average rewards per genre:", {genre: (agent.genre_rewards[genre] / agent.genre_counts[genre] if agent.genre_counts[genre] > 0 else 0) for genre in agent.genres})

This simplified example illustrates an agent taking actions (recommending genres), receiving rewards, and adjusting its strategy over time.

4. Learning from Examples (Instance-Based Learning)

While not a separate “form” of learning in the same category as supervised/unsupervised/reinforcement, “Learning from Examples” often refers to instance-based learning or memory-based learning. These algorithms don’t create an explicit, generalized model during training. Instead, they simply store the training examples and generalize only when a new query comes in, by comparing it to the stored examples.

Analogy: Imagine a librarian who doesn’t memorize every book’s content but remembers where similar books are located. When you ask for a book on a new topic, they find the most similar books they’ve already cataloged to recommend new ones.

Example: K-Nearest Neighbors (KNN) in action

Our earlier Supervised Learning example with KNeighborsClassifier is a perfect illustration of learning from examples.

When knn_model.fit(X_train, y_train) is called, the model doesn’t build complex rules or equations. It simply stores all the training data (X_train and y_train).

When we call knn_model.predict(new_user_features), the model does the following:

It calculates the distance between the new_user_features and all the stored training examples.
It identifies the k (in our case, 3) nearest training examples to the new_user_features.
It then looks at the user_liked_movie labels of these 3 nearest neighbors.
It assigns the most frequent label among these neighbors as the prediction for the new_user.

This shows how it directly “learns” from the examples themselves at prediction time, rather than compiling a general model beforehand.

Conclusion: A Spectrum of Intelligence

Machine learning offers a powerful toolkit for extracting insights and making decisions from data. While each learning paradigm has its distinct characteristics:

Supervised Learning excels when you have clear, labeled data and want to predict a specific outcome (classification or regression). It’s about direct prediction based on known answers.
Unsupervised Learning is your go-to when you have unlabeled data and want to uncover hidden structures, patterns, or segments. It’s about discovery and organization.
Reinforcement Learning is ideal for sequential decision-making in dynamic environments, where an agent learns through interaction and feedback (rewards/penalties). It’s about strategic learning over time.
Learning from Examples (Instance-Based Learning), like KNN, is a specific approach often found within supervised or unsupervised contexts. It stands out because models directly refer to stored instances for predictions, making them intuitive and often easy to interpret, though sometimes less efficient for very large datasets.

These different forms of machine learning are not mutually exclusive; they often complement each other in real-world applications. Understanding these distinctions is crucial for choosing the right approach to solve your data-driven problems effectively.

Which type of machine learning are you most excited to explore in your next project?

Edit this post on GitHub

Alberto Ivo