**Aim Of Concept:**

- To understand the python libraries for data science
- To understand the basic Statistical and Probability measures for data science.
- To learn descriptive analytics on the benchmark data sets.
- To apply correlation and regression analytics on standard data sets.
- To present and interpret data using visualization packages in Python.

**CS3361- Data Science Laboratory Syllabus**

**List Of Experiments:**

- Download, install and explore the features of NumPy, SciPy, Jupyter, Statsmodels and Pandas packages.
- Working with Numpy arrays
- Working with Pandas data frames
- Reading data from text files, Excel, and the web and exploring various commands for doing descriptive analytics on the Iris data set.
- Use the diabetes data set from UCI and Pima Indians Diabetes data set for performing the following:

a. Univariate analysis: Frequency, Mean, Median, Mode, Variance, Standard Deviation, Skewness and Kurtosis.

b. Bivariate analysis: Linear and logistic regression modeling

c. Multiple Regression analysis

d. Also, compare the results of the above analysis for the two data sets. - Apply and explore various plotting functions on UCI data sets.

a. Normal curves

b. Density and contour plots

c. Correlation and scatter plots

d. Histograms

e. Three dimensional plotting - Visualizing Geographic Data with Basemap

