Probability Basics
-
Learn basic probability concepts essential for data analytics.
Probability Concepts
What is Probability?
Probability measures how likely an event is to happen.
Probability=Favorable OutcomesTotal Possible OutcomesProbability = \frac{\text{Favorable Outcomes}}{\text{Total Possible Outcomes}}Probability=Total Possible OutcomesFavorable Outcomes
Probability value is always between 0 and 1
0 → Impossible event
1 → Certain event
Example 1: Tossing a Coin
Possible outcomes: Head (H), Tail (T)
P(H)=1/2=0.5P(H) = 1/2 = 0.5P(H)=1/2=0.5
Example 2: Rolling a Dice
Total outcomes = 6
P(rolling 4)=1/6P(rolling\ 4) = 1/6P(rolling 4)=⅙
Important Terms
Experiment → Process (e.g., tossing coin)
Outcome → Result (Head or Tail)
Event → Set of outcomes
Sample Space → All possible outcomes
Probability Rules
1. Addition Rule
Used when events are mutually exclusive.
P(A or B)=P(A)+P(B)P(A \text{ or } B) = P(A) + P(B)P(A or B)=P(A)+P(B)
Example:
Probability of rolling 1 or 2 on dice:1/6+1/6=2/6=1/31/6 + 1/6 = 2/6 = 1/31/6+1/6=2/6=⅓
2. Multiplication Rule
Used for independent events.
P(A and B)=P(A)×P(B)P(A \text{ and } B) = P(A) \times P(B)P(A and B)=P(A)×P(B)
Example:
Probability of getting Head twice:1/2×1/2=1/41/2 \times 1/2 = 1/41/2×1/2=¼
3. Complement Rule
P(A′)=1−P(A)P(A') = 1 - P(A)P(A′)=1−P(A)
Example:
If P(Rain) = 0.3
Then P(No Rain) = 0.7Introduction to Probability Distributions
A Probability Distribution shows how probabilities are distributed over values of a random variable.
There are two main types:
A) Discrete Distribution
Used for countable outcomes.
Examples:
Dice roll
Number of students
Common distribution:
Binomial Distribution
B) Continuous Distribution
Used for measurable data.
Examples:
Height
Weight
Salary
Common distribution:
Normal Distribution (Bell Curve)
Properties of Normal Distribution:
Symmetrical shape
Mean = Median = Mode
68% data within 1 SD
95% data within 2 SD
Applications in Data Analytics
1. Risk Analysis
Used in finance to measure investment risk.
2. Machine Learning
Used in classification & prediction models.
3. A/B Testing
Helps compare two marketing strategies.
4. Forecasting
Predict sales, demand, trends.
5. Fraud Detection
Identify unusual behavior using probability models.