UNIT 1:
Big Data and Data Science, A Project on Data Analytics
A Little History on Methodologies for Data Analytics
; Data Analytics- Types, Tools and Applications
KDD Process, CRISP-DM Methodology
UNIT 2:
Descriptive Univariate Analysis, Descriptive Bivariate Analysis; Descriptive Multivariate Analysis
Multivariate Frequencies, Multivariate Data Visualization, Multivariate Statistics
Infographics and Word Clouds
Data Quality - Missing Values, Redundant Data, Inconsistent Data, Noisy Data, Outliers
UNIT 3:
Distance Measures for Non-conventional Attributes
Clustering Techniques - K-means, Centroids and Distance Measures, DBSCAN
UNIT 4:
Predictive Performance Measures for Classification
Distance-based Learning Algorithms
K-nearest Neighbor Algorithms, Case-based Reasoning; Probabilistic Classification Algorithms -
Logistic Regression Algorithm, Naive Bayes Algorithm.
UNIT 5:
DA Applications for Text, Web and Social Media
Working with Texts, Recommender Systems, Social Network Analysis