www.free-education.in is a platform where you can get pdf notes from 6th to 12th class notes, General Knowledge post, Engineering post, Career Guidelines , English Speaking Trick , How to crack interview and lots more.Statistics Notes & NCERT Solutions
Introduction to Statistics
Ungrouped data is data in its original or raw form. The observations are not classified into groups.
For example, the ages of everyone present in a classroom of kindergarten kids with the teacher is as follows:
3, 3, 4, 3, 5, 4, 3, 3, 4, 3, 3, 3, 3, 4, 3, 27.
This data shows that there is one adult present in this class and that is the teacher. Ungrouped data is easy to work when the data set is small.
In grouped data, observations are organized in groups.
For example, a class of students got different marks in a school exam. The data is tabulated as follows:
|No. of Students||13||9||36||32||10|
This shows how many students got the particular mark range. Grouped data is easier to work with when a large amount of data is present.
Frequency is the number of times a particular observation occurs in data.
Data can be grouped into class intervals such that all observations in that range belong to that class.
Class width = upper class limit – lower class limit
Direct method of finding mean
Step 1: Classify the data into intervals and find the corresponding frequency of each class.
Step 2: Find the class mark by taking the midpoint of the upper and lower class limits.
Step 3: Tabulate the product of class mark and its corresponding frequency for each class. Calculate their sum (∑xifi).
Step 4: Divide the above sum by the sum of frequencies (∑fi) to get the mean.
Assumed mean method of finding mean
Important relations between methods of finding mean
- All three methods of finding mean yield the same result.
- Step deviation method is easier to apply if all the deviations have a common factor.
- Assumed mean method and step deviation method are simplified versions of the direct method.
Finding the Median of Grouped Data when class Intervals are not given
Step 1: Tabulate the observations and the corresponding frequency in ascending or descending order.
Step 2: Add the cumulative frequency column to the table by finding the cumulative frequency up to each observation.
Step 3: If the number of observations is odd, the median is the observation whose cumulative frequency is just greater than or equal to (n+1)/2
If the number of observations is even, the median is the average of observations whose cumulative frequency is just greater than or equal to n/2 and (n/2)+1.
To know more about Median, visit here.
Cumulative frequency is obtained by adding all the frequencies up to a certain point.
Finding median for Grouped Data when class Intervals are given
Cumulative Frequency distribution of less than type
Cumulative frequency of the less than type indicates the number of observations which are less than or equal to a particular observation.
Cumulative Frequency distribution of more than type
Cumulative frequency of more than type indicates the number of observations which are greater than or equal to a particular observation.
To know more about Cumulative Frequency Distribution, visit here.
Visualising formula for median graphically
Median from Cumulative Frequency Curve
Step 1: Identify the median class.
Step 2: Mark cumulative frequencies on the y-axis and observations on the x-axis corresponding to the median class.
Step 3: Draw a straight line graph joining the extremes of class and cumulative frequencies.
Step 4: Identify the point on the graph corresponding to cf=n/2
Step 5: Drop a perpendicular from this point on to the x-axis.
Ogive of less than type
The graph of a cumulative frequency distribution of the less than type is called an ‘ogive of the less than type’.
Ogive of more than type
The graph of a cumulative frequency distribution of the more than type is called an ‘ogive of the more than type’.
To know more about Ogive, visit here.
Relation between the less than and more than type curves
The point of intersection of the ogives of more than and less than types gives the median of the grouped frequency distribution.
Finding mode for Grouped Data when class intervals are not given
In grouped data without class intervals, the observation having the largest frequency is the mode.
Finding mode for Ungrouped Data
For ungrouped data, the mode can be found out by counting the observations and using tally marks to construct a frequency table.
The observation having the largest frequency is the mode.
Finding mode for Grouped Data when class intervals are given
For, grouped data, the class having the highest frequency is called the modal class. The mode can be calculated using the following formula. The formula is valid for equal class intervals and when the modal class is unique.
l= lower limit of modal class
h= class width
f1= frequency of the modal class
f0= frequency of the class preceding the modal class
f2= frequency of the class succeeding the modal class.
To know more about Mode, visit here.
Visualising formula for mode graphically
Step 1: Express the class intervals and frequencies as a histogram.
Step 2: Join the top corners of the modal class to the diagonally opposite corners of the adjacent classes
Step 3: Drop a perpendicular from the point of intersection of the above on the horizontal x-axis.
Measures of Central Tendency for Grouped Data
i) Mean is the average of a set of observations.
ii) Median is the middle value of a set of observations.
iii) A mode is the most common observation.
To know more about Central Tendency, visit here.
The best-suited measure of central tendency in different cases and the Empirical relationship between them
i) The mean takes into account all the observations and lies between the extremes. It enables us to compare distributions.
ii) In problems where individual observations are not important, and we wish to find out a ‘typical’ observation where half the observations are below and half the observations are above, the median is more appropriate. Median disregards the extreme values.
iii) In situations which require establishing the most frequent value or most popular item, the mode is the best choice.
Mean, mode and median are connected by the empirical relationship
3 Median = Mode + 2 Mean