Probability Basics

  • Learn basic probability concepts essential for data analytics.
  • Probability Concepts

    What is Probability?

    Probability measures how likely an event is to happen.

    Probability=Favorable OutcomesTotal Possible OutcomesProbability = \frac{\text{Favorable Outcomes}}{\text{Total Possible Outcomes}}Probability=Total Possible OutcomesFavorable Outcomes​

    Probability value is always between 0 and 1

    • 0 → Impossible event

    • 1 → Certain event


    Example 1: Tossing a Coin

    Possible outcomes: Head (H), Tail (T)

    P(H)=1/2=0.5P(H) = 1/2 = 0.5P(H)=1/2=0.5


    Example 2: Rolling a Dice

    Total outcomes = 6

    P(rolling 4)=1/6P(rolling\ 4) = 1/6P(rolling 4)=⅙


    Important Terms

    • Experiment → Process (e.g., tossing coin)

    • Outcome → Result (Head or Tail)

    • Event → Set of outcomes

    • Sample Space → All possible outcomes



    Probability Rules

    1. Addition Rule

    Used when events are mutually exclusive.

    P(A or B)=P(A)+P(B)P(A \text{ or } B) = P(A) + P(B)P(A or B)=P(A)+P(B)

    Example:
    Probability of rolling 1 or 2 on dice:

    1/6+1/6=2/6=1/31/6 + 1/6 = 2/6 = 1/31/6+1/6=2/6=⅓


    2. Multiplication Rule

    Used for independent events.

    P(A and B)=P(A)×P(B)P(A \text{ and } B) = P(A) \times P(B)P(A and B)=P(A)×P(B)

    Example:
    Probability of getting Head twice:

    1/2×1/2=1/41/2 \times 1/2 = 1/41/2×1/2=¼


    3. Complement Rule

    P(A′)=1−P(A)P(A') = 1 - P(A)P(A′)=1−P(A)

    Example:
    If P(Rain) = 0.3
    Then P(No Rain) = 0.7



    Introduction to Probability Distributions

    A Probability Distribution shows how probabilities are distributed over values of a random variable.

    There are two main types:


    A) Discrete Distribution

    Used for countable outcomes.

    Examples:

    • Dice roll

    • Number of students

    Common distribution:

    • Binomial Distribution


    B) Continuous Distribution

    Used for measurable data.

    Examples:

    • Height

    • Weight

    • Salary

    Common distribution:

    • Normal Distribution (Bell Curve)

    Properties of Normal Distribution:

    • Symmetrical shape

    • Mean = Median = Mode

    • 68% data within 1 SD

    • 95% data within 2 SD



    Applications in Data Analytics

    1. Risk Analysis

    Used in finance to measure investment risk.

    2. Machine Learning

    Used in classification & prediction models.

    3. A/B Testing

    Helps compare two marketing strategies.

    4. Forecasting

    Predict sales, demand, trends.

    5. Fraud Detection

    Identify unusual behavior using probability models.