The Machine Studying “Creation Calendar” Day 5: GMM in Excel

By AIMaharshiBhrugu Bhatt

December 6, 2025

0

20

In the earlier article, we explored distance-based clustering with Ok-Means.

additional: to enhance how the space may be measured we add variance, with a view to get the Mahalanobis distance.

So, if k-Means is the unsupervised model of the Nearest Centroid classifier, then the pure query is:

What’s the unsupervised model of QDA?

Which means that like QDA, every cluster now needs to be described not solely by its imply, but in addition by its variance (and we even have so as to add covariance if the variety of options is increased than 2). However right here all the pieces is discovered with out labels.

So that you see the concept, proper?

And properly, the title of this mannequin is the Gaussian Combination Mannequin (GMM)…

GMM and the names of those fashions…

As it’s usually the case, the names of the fashions come from historic causes. They don’t seem to be at all times designed to focus on the connections between fashions, if they aren’t discovered collectively.

Totally different researchers, totally different intervals, totally different use instances… and we find yourself with names that generally conceal the true construction behind the concepts.

Right here, the title “Gaussian Combination Mannequin” merely implies that the information is represented as a combination of a number of Gaussian distributions.

If we observe the identical naming logic as k-Means, it could have been clearer to name it one thing like k-Gaussian Combination

As a result of, in follow, as an alternative of solely utilizing the means, we add the variance. And we might simply use the Mahalanobis distance, or one other weighted distance utilizing each means and variance. However Gaussian distribution offers us possibilities which might be simpler to interpret.

So we select a quantity ok of Gaussian parts.

And by the best way, GMM shouldn’t be the one one.

In truth, your entire machine studying framework is definitely rather more current than most of the fashions it accommodates. Most of those strategies had been initially developed in statistics, sign processing, econometrics, or sample recognition.

Then, a lot later, the sphere we now name “machine studying” emerged and regrouped all these fashions beneath one umbrella. However the names didn’t change.

So as we speak we use a mix of vocabularies coming from totally different eras, totally different communities, and totally different intentions.

That is why the relationships between fashions will not be at all times apparent if you look solely on the names.

If we needed to rename all the pieces with a contemporary, unified machine-learning fashion, the panorama would truly be a lot clearer:

GMM would grow to be k-Gaussian Clustering
QDA would grow to be Nearest Gaussian Classifier
LDA, properly, Nearest Gaussian Classifier with the identical variance throughout courses.

And out of the blue, all of the hyperlinks seem:

k-Means ↔ Nearest Centroid
GMM ↔ Nearest Gaussian (QDA)

That is why GMM is so pure after Ok-Means. If Ok-Means teams factors by their closest centroid, then GMM teams them by their closest Gaussian form.

Why this complete part to debate the names?

Properly, the reality is that, since we already coated the k-means algorithm, and we already did the transition from Nearest Centroids Classifier to QDA, we already know all about this algorithm, and the coaching algorithm won’t change…

And what’s the NAME of this coaching algorithm?

Oh, Lloyd’s algorithm.

Really, earlier than k-means was referred to as so, it was merely often known as Lloyd’s algorithm, printed by Stuart Lloyd in 1957. Solely later, the machine studying group modified it to “k-means”.

And this algorithm manipulated solely the means, so we want one other title, proper?

You see the place that is going: the Expectation-Maximizing algorithm!

EM is solely the final type of Lloyd’s thought. Lloyd updates the means, EM updates all the pieces: means, variances, weights, and possibilities.

So, you already know all the pieces about GMM!

However since my article is known as “GMM in Excel”, I can not finish my article right here…

GMM in 1 Dimension

Allow us to begin with this easy dataset, the identical we used for k-means: 1, 2, 3, 11, 12, 13

Hmm, the 2 Gaussians can have the identical variances. So take into consideration enjoying with different numbers in Excel!

And we naturally need 2 clusters.

Listed below are the totally different steps.

Initialization

We begin with guesses for means, variances, and weights.

GMM in Excel – initialization step- picture by writer

Expectation step (E-step)

For every level, we compute how doubtless it’s to belong to every Gaussian.

GMM in Excel – expectation step – picture by writer

Maximization step (M-step)

Utilizing these possibilities, we replace the means, variances, and weights.

GMM in Excel – maximization step – picture by writer

Iteration

We repeat E-step and M-step till the parameters stabilise.

Every step is very simple as soon as the formulation are seen.
You will notice that EM is nothing greater than updating averages, variances, and possibilities.

We are able to additionally do some visualization to see how the Gaussian curves transfer in the course of the iterations.

Firstly, the 2 Gaussian curves overlap closely as a result of the preliminary means and variances are simply guesses.

The curves slowly separate, modify their widths, and at last settle precisely on the 2 teams of factors.

By plotting the Gaussian curves at every iteration, you possibly can actually watch the mannequin be taught:

the means slide towards the facilities of the information
the variances shrink to match the unfold of every group
the overlap disappears
the ultimate shapes match the construction of the dataset

This visible evolution is extraordinarily useful for instinct. When you see the curves transfer, EM is not an summary algorithm. It turns into a dynamic course of you possibly can observe step-by-step.

GMM in 2 Dimensions

The logic is precisely the identical as in 1D. Nothing new conceptually. We merely lengthen the formulation…

As an alternative of getting one characteristic per level, we now have two.

Every Gaussian should now be taught:

a imply for x1
a imply for x2
a variance for x1
a variance for x2
AND a covariance time period between the 2 options.

When you write the formulation in Excel, you will note that the method stays precisely the identical:

Properly, the reality is that for those who have a look at the screenshot, you may assume: “Wow, the method is so lengthy!” And this isn’t all of it.

However don’t be fooled. The method is lengthy solely as a result of we write out the 2-dimensional Gaussian density explicitly:

one half for the space in x1
one half for the space in x2
the covariance time period
the normalization fixed

Nothing extra.

It’s merely the density method expanded cell by cell.
Lengthy to sort, however completely comprehensible when you see the construction: a weighted distance, inside an exponential, divided by the determinant.

So sure, the method appears large… however the thought behind this can be very easy.

Conclusion

Ok-Means offers onerous boundaries.

GMM offers possibilities.

As soon as the EM formulation are written in Excel, the mannequin turns into easy to observe: the means transfer, the variances modify, and the Gaussians naturally settle across the knowledge.

GMM is simply the subsequent logical step after k-Means, providing a extra versatile approach to characterize clusters and their shapes.

The Machine Studying “Creation Calendar” Day 5: GMM in Excel

GMM and the names of those fashions…

GMM in 1 Dimension

Initialization

Expectation step (E-step)

Maximization step (M-step)

Iteration

GMM in 2 Dimensions

Conclusion

Related Articles

How Excessive-Visitors Leisure Platforms Elevate the Bar for Software program Testing

Your Genes Decide How Lengthy You’ll Reside Far Extra Than Beforehand Thought

Constructing an AI Agent to Detect and Deal with Anomalies in Time-Sequence Information

LEAVE A REPLY Cancel reply

Latest Articles

How Excessive-Visitors Leisure Platforms Elevate the Bar for Software program Testing

Your Genes Decide How Lengthy You’ll Reside Far Extra Than Beforehand Thought

Constructing an AI Agent to Detect and Deal with Anomalies in Time-Sequence Information

GitGuardian Raises $50M to Sort out AI Agent & Id Safety

🪜 Trendy Spiral Staircase Decor – Jewellery Organizer – 3D Printable STL, STEP, FBX -Architectural Miniature

About US