How Machine Learning Really Works

In recent years, machine learning has evolved from a statistical tool to one of the most transformative technologies of our time. You’ll find it in self-driving cars, medical algorithms, Netflix recommendations, and — of course — in the chatbot you probably talk to every day.

But how does it really work? And how did we go from telling machines what to look at, to letting them figure it out on their own?

📌 It All Starts with a Question

The starting point is always the same: what do we want to predict?

It could be:

Tomorrow’s stock price
Whether a photo shows a cat or a dog
Or whether a patient has a certain medical condition

The first step is to define the task clearly. Then we need examples: known input–output pairs, like labeled images or translated sentences.

🔧 Back in the Day: Hand-Crafted Features

In the beginning, training a model meant turning every object into a list of numbers — a feature vector.
If you were working with molecules, for example, you’d write a mini-questionnaire:

Does it have this chemical group? ✅
Does it contain this atom? ❌

Each answer became a number in a vector. These numbers went into the model to help it make predictions, like whether the molecule is toxic.

It worked — but required deep domain expertise, and the risk of injecting bias or noise was always present.

🤖 Today’s Models Learn Everything — End-to-End

Nowadays, we feed models raw data (images, text, graphs) and let them learn both:

how to represent it, and
how to use it to make predictions.

For images: convolutional neural networks (CNNs) or transformers.
For text: transformers (yes, the ones behind GPT).
For molecules or graphs: graph neural networks.

The magic is in the end-to-end learning: the model teaches itself how to go from raw input to useful output.

🧭 Generalization Is the Goal

We don’t just want a model that performs well on examples it has seen.
We want it to generalize — to work on new, unseen data.

That’s why we split our dataset: one part is used to train the model, and another (called the validation set) acts like a surprise quiz.

If it performs well on the quiz, we’re on the right track.

🔄 Welcome to the Pre-training Era

Until recently, every time you trained a model, you had to start from scratch.
Now there’s a better way: pre-train a model on general-purpose tasks (often self-supervised), then fine-tune it with a small amount of task-specific data.

Examples:

Text: predicting the next word (how GPT was born)
Images: reconstructing missing parts or solving contrastive tasks
Multimodal: matching images with captions

These tasks are scalable, automatic, and — frankly — work incredibly well.

🌍 Foundation Models: One Model to Rule Them All

These pre-trained giants are called foundation models — large, general, and versatile.
You train them once on billions of examples, then adapt them to a wide range of tasks.

With minimal fine-tuning, they can:

Translate
Answer questions
Write code
Analyze images
Reason across modalities

They’ve become the new base layer of modern AI.

⚡️ Why This Leap Was Possible

This revolution wasn’t just architectural — it was infrastructural.

Access to massive datasets (web-scale corpora, public image collections, biomedical graphs)
Powerful hardware (GPUs, TPUs, clusters of machines)
Distributed training and open-source tooling (like PyTorch and JAX)

Together, they enabled the scaling of models from thousands to hundreds of billions of parameters.

⚠️ The Other Side: Bias, Cost, and Control

As powerful as these models are, they come with serious challenges:

Bias in training data can lead to biased predictions
Interpretability remains limited — we often don’t know why a model makes a choice
Environmental cost: training huge models can require energy equivalent to hundreds of tons of CO₂
Overreliance: when humans trust opaque models without questioning their outputs

Understanding these trade-offs is essential for building responsible AI.

🔍 What’s Next? The Frontier of ML Research

The field is still moving fast. Key directions include:

Efficient transformers: faster and cheaper variants for low-resource settings
Mixture-of-Experts models: only parts of the model are active at once
Retrieval-augmented generation: models that “look things up” before answering
Agentic behavior: systems that can plan, explore, and interact autonomously

Machine learning is moving from passive pattern recognition toward active decision-making and reasoning.

✅ TL;DR

Machine learning has evolved from “classifying cats and dogs” using hand-crafted features…
…to understanding the world from raw data, at massive scale, with general-purpose foundation models that can adapt across domains.

We’re no longer just teaching machines to label the world —
we’re teaching them to interpret it.
And perhaps, one day, to help us reimagine it.

From Hand-Crafted Features to Foundation Models: How Machine Learning Learned to Understand the World

📌 It All Starts with a Question

🔧 Back in the Day: Hand-Crafted Features

🤖 Today’s Models Learn Everything — End-to-End

🧭 Generalization Is the Goal

🔄 Welcome to the Pre-training Era

🌍 Foundation Models: One Model to Rule Them All

⚡️ Why This Leap Was Possible

⚠️ The Other Side: Bias, Cost, and Control

🔍 What’s Next? The Frontier of ML Research

✅ TL;DR

AI & Collective Intelligence in Organizations

How AI Can Enhance Business Strategy: An Analysis of Porter’s Framework

More from this blog

Conclusion: The Sustainability of Architecture: Summary, Risk, and Continuous Growth

The Architect's Career: Navigating Specialization and Technical Leadership

The Journey Continues: The Architect as a Perpetual Change Agent

Measuring Success: When Architecture Truly Worked

Developing the Architects Team: Authority, Responsibility, and Support

Command Palette

📌 It All Starts with a Question

🔧 Back in the Day: Hand-Crafted Features

🤖 Today’s Models Learn Everything — End-to-End

🧭 Generalization Is the Goal

🔄 Welcome to the Pre-training Era

🌍 Foundation Models: One Model to Rule Them All

⚡️ Why This Leap Was Possible

⚠️ The Other Side: Bias, Cost, and Control

🔍 What’s Next? The Frontier of ML Research

✅ TL;DR

AI & Collective Intelligence in Organizations

How AI Can Enhance Business Strategy: An Analysis of Porter’s Framework

More from this blog