CS 215 - Data Interpretation and Analysis

Instructor: Ajit Rajwade
Office: SIA-218, KReSIT Building
Email:

Lecture Timings: Slot 3: Monday 10:35 am to 11:30 am, Tuesday 11:35 am to 12:30 pm, Thursday 8:30 am to 9:25 am

Lecture Venue: LA 002

Instructor Office hours: Tuesday 12:30 to 1:15 pm in LA 002. (Feel free to send queries over email or moodle)

Teaching Assistants:
  • Srijan Das
  • Ayush Pratap Singh
  • Manivannan N
  • Abhay Raj
  • Mohammad Kashif Khan
  • Anirban Paul
  • Kumar Rajnish
  • Some UG TAs will be also be allocated

Topics to be covered (tentative list)


Intended Audience

2nd year BTech students from CSE

Learning Materials and Textbooks

Computational Resources


Grading Policy (tenative)


Other Policies


Tutorials

Quizzes

Lecture Schedule:


Date

Content of the Lecture

Assignments/Readings/Notes

28/07
  • Introduction, course overview and course policies
29/07
    Descriptive Statistics
  • Terminology: population, sample, discrete and continuous valued attributes
  • Frequency tables, frequency polyongs, line diagrams, pie charts, relative frequency tables
  • Histograms with examples for image intensity histograms, image gradient histograms
  • Histogram binning problem
  • Data summarization: Mean and Median
31/07
  • Data summarization: mean and median
  • "Proof" that median minimizes the sum of absolute deviations - using calculus
  • Proof that median minimizes the sum of absolute deviations, without using calculus
  • Concept of quantile/percentile
  • Calculation of mean and median in different ways from histogram or cumulative plots
  • Standard deviation and variance, some applications
  • Two-sided Chebyshev inequality with proof; One-side Chebyshev inequality (Chebyshev-Cantelli inequality)
4/8
  • Two-sided Chebyshev inequality with proof; One-side Chebyshev inequality (Chebyshev-Cantelli inequality)
  • Concept of correlation coefficient and formula for it; proof that its value lies from -1 to +1
  • Correlation coefficient: properties; uncentered correlation coefficient; limitations of correlation coefficient and Anscombe's quartet
  • Correlation and causation
5/8 Discrete Probability
  • Discrete probability: sample space, event, composition of events: union, intersection, complement, exclusive or, De Morgan's laws
  • Boole's and Bonferroni's inequalities
  • Conditional probability, Bayes rule, False Positive Paradox
  • Independent and mutually exclusive events
  • Birthday paradox
7/8
  • Independent and mutually exclusive events
  • Birthday paradox
MATLAB Tutorial
  • Code vectorization: vectors and matrix operations
  • Plotting graphs, scatterplots, images in MATLAB
  • Some functions for computing statistical quantities