Link Search Menu Expand Document

Introduction

Python was introduced as a General Purpose language. It was initially used for Frontend, Backend, WebDevelopment. But Now it is mainly used for DataAnalytics due to the library specially built for this Analytics and Visualization.

Why Python than R

Python can handle large amount of Data, Where R becomes slower to handle because it runs on RAM. But python runs in HardDisk where we dont need to worry about storage.

Download Python

Link for downloding Python- https://www.python.org/downloads/ I installed python for MAC, you can install based on your OS. Since i installed python, i installed Jupyter Notebook, which is a web based interactive platform for running python code.

Commands for Jypyter notebook installation

cd /usr/local/bin (python installation dir in my sys)
pip install jupyterlab
pip install notebook

Command to launch Jupyter notebook

jupyter notebook

Python libraries

Below are Python Libraries. Check the libraries and its purposes. When you know the purpose, it would be easy when we include them during programming.

Pandas

Pandas is a dataframe in python. It is used for structured data operations and manipulations. Also used for DataPreprocessing and Modelling. Helps to have your input in structured way.

SciPy

Scientific Python- Built on top of NumPy. It can be used for optimization and Advanced fourier transform.

SymPy

Used for Symbolic Computations

NumPy

Numerical Python-Deals with Numerical Values, Contains basic Linear Algebra function, Fourier Transform, Advanced Random Number Capability, Integration tools with low level languages like c and c++.

Matplotlib

Designed for Visualization. But only for static images.

Scikit Learn

Helps to develop ML Models. Built on top of NumPy, SciPy, MatplotLib. It contains ML Algorithm.

Blaze

Used to access data from different DB.

Scrapy

Helps to start something from start to end. Helps to scrap the social websites from start and dig through it.

Bokeh

Helps to generate interactive plots.

Seaborn

Helps to develelop interactive visualization.

Statsmodels

Helps to devlop Statistical Models, and Tests(ANOVA, CHI-Square)

os

Used for IN/OUT operations and File Transfer operations.

NLTK

Designed for Text Mining.

BeautifulSoup

Used for scraping the websites. This is lower to Scrapy.

Requests

Used for accessing the web for IN and OUT operations

Hope it helps to some extent. We will start with python programming from our next.