The most important assumption in statistics- IID [Math Mondays]
Everything from basic 8th grade probability to complex Probabilistic Graphs rely on this assumption
To learn more about the newsletter, check our detailed About Page + FAQs
To help me understand you better, please fill out this anonymous, 2-min survey. If you liked this post, make sure you hit the heart icon in this email.
Happy Monday my amazing readers,
Today we will be covering one of the most important assumptions in all of Math. Statistical Analysis would not be possible without it. Even if you aren’t directly a mathematician/work in Data Analysis, it is crucial that you know this. There are a lot of misunderstandings about this concept. This can lead to terrible consequences since people can make faulty decisions based on flawed reasoning. For example, the notorious gambler’s fallacy can be avoided by understanding this concept. Most human biases in fact, can be improved/fixed by integrating this concept into your thoughts. So you don’t want to miss this. Don’t worry, we’ll keep things simple.
The IID assumption states that all the data samples taken from sampling/observations are IID. IID stands for Independently and Identically Distributed. What do these mean? And how can you spot these? Let’s cover that-
Key Highlights-
What is IID- A distribution is IID if the samples in it are Independently and Identically Distributed.
Independent Distribution- The result of one sample doesn’t influence another. Think of rolling the dice. One die roll doesn’t impact the other.
Identical Distribution- Two samples are from an identical distribution when their behavior across a large number of samples will be identical. Take 2 fair dice. If we rolled them both a 1000000 times and tracked the numbers they throw up, we would imagine that their results would be (close to) identical. Thus they come from identical distributions.
Why this matters- Many statistical sampling and inference techniques need to assume this to make deductions. The video below covers the IID assumption in Machine Learning
What people get wrong- People often make decisions because they incorrectly attribute IID traits to non-IID distributions. Or as is the case with Gambler’s Fallacy, vice versa. Both mistakes you want to avoid.
How to avoid this- Good news, this doesn’t require any fancy math. Any time you find yourself making an assumption about any probability distribution, take a step back and do a double take. You’ll be surprised, how many times our intuitive brains misattribute traits. A great book to learn more about this is the exceptional book, Thinking Fast and Slow.
To those of interested in more Mathy understanding of the subject, take a look at the following definitions
I got it from this excellent presentation. Check it out if you have aspirations of getting into Data Analysis/AI. It will help you a bunch.
I created Technology Interviews Made Simple using new techniques discovered through tutoring multiple people into top tech firms. The newsletter is designed to help you succeed, saving you from hours wasted on the Leetcode grind. I have a 100% satisfaction policy, so you can try it out at no risk to you. You can read the FAQs and find out more here. Use the button below to get 50% off for upto a whole year.
This is a limited-time offer, that I’m running to celebrate my almost 50 questions completed. Take advantage of it by clicking the button above.
Before proceeding, if you have enjoyed this post so far, please make sure you like it (the little heart button in the email/post). I also have a special request for you.
***Special Request***
This newsletter has received a lot of love. If you haven’t already, I would really appreciate it if you could take 5 seconds to let Substack know that they should feature this publication on their pages. This will allow more people to see the newsletter.
There is a simple form in Substack that you can fill up for it. Here it is. Thank you.
https://docs.google.com/forms/d/e/1FAIpQLScs-yyToUvWUXIUuIfxz17dmZfzpNp5g7Gw7JUgzbFEhSxsvw/viewform
To get your Substack URL, follow the following steps-
Open - https://substack.com/
If you haven’t already, log in with your email.
In the top right corner, you will see your icon. Click on it. You will see the drop-down. Click on your name/profile. That will show you the link.
You will be redirected to your URL. Please put that in to the survey. Appreciate your help.
In the comments below, share what topic you want to focus on. I’d be interested in learning and will cover them. To learn more about the newsletter, check our detailed About Page + FAQs
If you liked this post, make sure you fill out this survey. It’s anonymous and will take 2 minutes of your time. It will help me understand you better, allowing for better content.
https://forms.gle/XfTXSjnC8W2wR9qT9
I see you living the dream.
Go kill all and Stay Woke,
Devansh <3
To make sure you get the most out of Math Mondays, make sure you’re checking in the rest of the days as well. Leverage all the techniques I have discovered through my successful tutoring to easily succeed in your interviews and save your time and energy by joining the premium subscribers down below. Get a discount (for a whole year) using the button below
Reach out to me on:
Instagram: https://www.instagram.com/iseethings404/
Message me on Twitter: https://twitter.com/Machine01776819
My LinkedIn: https://www.linkedin.com/in/devansh-devansh-516004168/
My content:
Read my articles: https://rb.gy/zn1aiu
My YouTube: https://rb.gy/88iwdd
Get a free stock on Robinhood. No risk to you, so not using the link is losing free money: https://join.robinhood.com/fnud75