## Syllabus

##### Introduction to Python

- Concepts of Python Programming
- Configuration of Development Environment
- Using the Python Interpreter
- Numbers and Strings

##### More on Python

- Tuples and Lists
- Functions
- Control Flow and Loops
- Dictionaries

##### Data Science Fundamentals

- Introduction to Data Science
- Real world Use-Cases of Data Science
- Walkthrough of Data Types
- Data Science Project Lifecycle

##### Introduction to NumPy

- Basics of NumPy Arrays
- Mathematical operations in NumPy
- NumPy Array Manipulation
- NumPy Array Broadcasting

##### Data Manipulation with Pandas

- Data Structures in Pandas-Series and Data Frames
- Data Cleaning in Pandas
- Data Manipulation in Pandas
- Handling Missing Values in Dataset

##### Data Visualization in Python

- Plotting Basic Charts in Python
- Data Visualization with Matplotlib
- Statistical Data Visualization with Seaborn
- Coding Sessions using Matplotlib, Seaborn

##### Exploratory Data Analysis

- Introduction to Exploratory Data Analysis (EDA)
- Plots to Explore Relationship between Two Variables
- Histograms, Box Plots to Explore Variable
- Heat maps, Pair Plots to Explore Correlations
- Perform EDA to Explore Survival using Titanic Dataset

##### Introduction to Machine Learning

- What is Machine Learning?
- Use Cases of Machine Learning
- Types of Machine Learning
- Machine Learning Workflow

##### Linear Regression

- Introduction to Linear Regression
- Use Cases of Linear Regression
- How to fit a Linear Regression Model?
- Evaluating and Interpreting Results from Linear Regression models
- Predict Bike sharing Demand

##### Logistic Regression

- Introduction to Logistic Regression
- Logistic Regression Use Cases
- Understand use of odds & Logit function
- Predicting Credit Card Default Cases

##### Decision Trees & Random Forest

- Introduction to Decision Trees & Random Forest
- Using Ensemble Methods in Decision Trees
- Applications of Random Forest
- Predict passenger Survival Using Titanic Dataset

##### Model Evaluation Techniques

- Metrics and Model Selection in Machine
- Matrix for Predictions
- Measures of Model Evaluation-Sensitivity
- Use AUC-ROC Curve to Decide Best Model
- Applying Model Evaluation Techniques

##### Dimensionality Reduction using PCA

- Introduction to Curse of Dimensionality
- What is Dimensionality Reduction?
- Applications of Principle Component Analysis
- Optimize Model Using PCA

##### K-Nearest Neighbours

- Introduction to KNN
- Calculate Neighbours Using Distance Measures
- Find Optimal Value of K in KNN Method
- Advantage & Disadvantages of KNN
- Classify Phishing Site Data Using Close Neighbour

##### Naive Bayes Classifier

- Introduction to Naive Bayes Classification
- Refresher on Probability Theory
- Applications of Naive Bayes Algorithm
- Classify Spam Emails based on Probability

##### K-Means Clustering

- Introduction to K-means Clustering
- Decide Clusters by Adjusting Centroids
- Find Optimal 'K value' in K-means
- Understand Applications of Clustering
- Segment Hands in Pokerdata and Segment

##### Support Vector Machines

- Introduction to SVM
- Figure Decision Boundaries Using Support Vectors
- Identify Hyperplane in SVM
- Applications of SVM in Machine Learning
- Predicting wine quality using SVM

## Certification

####
Executive Program in Data Science Technology Certified By **Microsoft**

**Happy Clients** Our success is measured by results.

**Projects-** Our focus in on delivery a better content.

**Years of experience **In Imparting Quality Training across Verticals.

**Students** Placed in Top MNC's

## Platforms Covered

#### Python

Python is an interpreted, high-level, general-purpose programming language.

#### Pandas

Pandas is a software library written for the Python programming language for data manipulation and analysis. In particular, it offers data structures and operations for manipulating numerical tables and time series.

#### Numpy

NumPy is a library for the Python programming language, adding support for large, multi-dimensional arrays and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

#### Jupyter Notebook

Project Jupyter is a nonprofit organization created to "develop open-source software, open-standards, and services for interactive computing across dozens of programming languages"

#### Scikit-Learn

Scikit-learn is a free software machine learning library for the Python programming language. It features various classification, regression and clustering algorithms including support vector machines, random forests, gradient boosting, k-means and DBSCAN, and is designed to interoperate with the Python numerical and scientific libraries NumPy and SciPy

## Testimonials

### Pankaj Singh

Learning is very good here. Trainers are very good for Azure and Aws. Completed my Aws & Azure Training.

### Harish Pandey

I have completed my AZURE technologies.Training session was good. Thanks to my trainer. Thanks Vepsun Team.

### Ankit Singh

Best institute offering a AWS & Azure course within this good cost. Trainer was always ready to clear our doubt and support us. Also they have a good student coordinator.

### Nishant Sinha

Enrolled here for the course of Linux, trainers are highly qualified with great experience, staffs were quite helpful Kavita and Alka.

### Shiva Reddy

Artifical Training content was very helpfull for me to get the job. Teaching and explanation was very good.Good experience overall.

## Instructors and Experts

Learn from India's Best Leading Faculty and Industry leaders

#### Sanjeev Singh

EXP 18+#### Sameer

EXP 15+#### Satwik Muthappa

EXP 15+#### Mujaheed

EXP 12+## Program Fee

### Data Science

#### INR. 2000*

* Inclusive of all Taxes

- 4 Session/ classes
- Online - live Classes

### Data Science

#### INR. 24,000*

* Inclusive of all Taxes

- Training
- Single Certification
- Online - live Classes
- No Cost EMI Available

### Data Science

#### INR. 40,000*

* Inclusive of all Taxes

- Training
- Dual Certification
- Online - live Classes
- No Cost EMI Available

## Frequently Asked Questions

#### What is Data Science?

Data science is a broad field that refers to the collective processes, theories, concepts, tools and technologies that enable the review, analysis and extraction of valuable knowledge and information from raw data. It is geared toward helping individuals and organizations make better decisions from stored, consumed and managed data. Data science is formerly known as datalogy.

#### What is the difference between supervised and unsupervised machine learning?

**Supervised Machine learning:**
Supervised learning is the learning of the model where with input variable ( say, x) and an output variable (say, Y) and an algorithm to map the input to the output.
That is, Y = f(X).

**Unsupervised Machine learning:**
Unsupervised learning is where only the input data (say, X) is present and no corresponding output variable is there.

#### What is pruning in Decision Tree ?

When we remove sub-nodes of a decision node, this process is called pruning or opposite process of splitting.

#### What is Random Forest?

Random forest is a versatile machine learning method capable of performing both regression and classification tasks. It is also used for dimentionality reduction, treats missing values, outlier values. It is a type of ensemble learning method, where a group of weak models combine to form a powerful model.

#### What is deep learning?

Deep learning is a machine learning technique. It teaches a computer to filter inputs through layers to learn how to predict and classify information. Observations can be in the form of images, text, or sound.

## Contact Us

We offer most advanced technologies than any other computer and business training company. Businesses and individuals can choose from the course offerings, delivered by experts.

1st Floor, 104, S R Arcade, 6th Cross, Marathahalli, Bangalore - 560037.

info@vepsun.com

+91 90-363-63007

+91 90-353-53007