List python libraries use for data science

Data Processing and Model Deployment

NumPy



NumPy is a library used for numerical computing in Python. It provides fast array computations and is used for performing mathematical operations on arrays and matrices.
Website: https://numpy.org/

Pandas



Pandas is a library used for data manipulation and analysis. It is used for reading, writing, and analyzing data in a tabular format.
Website: https://pandas.pydata.org/

Scikit-learn



Scikit-learn is a machine learning library used for building predictive models. It provides a wide range of tools for data preprocessing, feature extraction, and model selection.
Website: https://scikit-learn.org/

TensorFlow



TensorFlow is a popular library for building machine learning models, especially neural networks. It is widely used for deep learning tasks and provides tools for building, training, and evaluating models.
Website: https://www.tensorflow.org/

Keras



Keras is a high-level neural networks API, written in Python and capable of running on top of TensorFlow, CNTK, or Theano. It is used for building and training neural networks.
Website: https://keras.io/

NLTK



The Natural Language Toolkit (NLTK) is a library used for natural language processing (NLP) tasks such as tokenization, stemming, and parsing.
Website: https://www.nltk.org/

SciPy


SciPy is a library used for scientific computing in Python. It provides modules for optimization, integration, linear algebra, and statistics.

Statsmodels



Statsmodels is a library used for statistical modeling and data analysis. It provides tools for regression analysis, time-series analysis, and hypothesis testing.
Website: https://www.statsmodels.org/

PyCaret



PyCaret is a fully accessible machine learning package for model deployment and data processing. It allows you to save time because it is a low-code library. It's a user-friendly machine learning library that will help you run end-to-end machine learning tests, whether you're trying to suggest missing values, analyzing categorical data, engineering features, tuning hyperparameters, or generating ensemble models.
Website: https://pycaret.org/

OpenCV



Licensed under the BSD, OpenCV is a free machine learning and computer vision library. It offers a shared architecture for computer vision applications to streamline the implementation of computer vision in commercial products.
Website: https://opencv.org/

Data Mining and Data Scraping

Scrapy


An open source and collaborative framework for extracting the data you need from websites. In a fast, simple, yet extensible way.

BeautifulSoup



BeautifulSoup is a Python data scraping and mining library that scrapes HTML and XML source data. It allows data scientists to develop a web crawler that crawls across websites. BeautifulSoup can retrieve data and structure it in the desired format. The scraped HTML data includes a lot of scrambled web data that users can't interpret. Its most recent version, BS4 (BeautifulSoup 4), arranges the jumbled web data into easy-to-understand XML structures, allowing for data analysis. BeautifulSoup identifies encodings automatically and smoothly interprets HTML documents, including those with special characters. We can search through a parsed document and find what we're looking for in it.
Website:

Data Visualization

Seaborn



Seaborn is a Python data visualization library based on Matplotlib. It provides more advanced and visually appealing visualizations than Matplotlib.

Matplotlib



Matplotlib is a data visualization library that provides a wide range of graphs, charts, and plots.

Ggplot


ggplot is a Python implementation of the grammar of graphics. It is not intended to be a feature-for-feature port of ggplot2 for R--though there is much greatness in ggplot2, the Python world could stand to benefit from it. So there will be feature overlap, but not neccessarily mimicry (after all, R is a little weird).

Plotly



plotly.py is an interactive, open-source, and browser-based graphing library for Python sparkles Built on top of plotly.js, plotly.py is a high-level, declarative charting library. plotly.js ships with over 30 chart types, including scientific charts, 3D graphs, statistical charts, SVG maps, financial charts, and more. Plotly graphs can be viewed in Jupyter notebooks, standalone HTML files, or integrated into Dash applications.

Altair



Altair is a declarative statistical visualization library for Python. Its simple, friendly and consistent API, built on top of the powerful Vega-Lite grammar, empowers you to spend less time writing code and more time exploring your data.

Autoviz



Automatically Visualize any dataset, any size with a single line of code. Now you can save these interactive charts as HTML files automatically with the "html" setting.