/

data-science

machine-learning
python
deep-learning
data-analysis
data-visualization
data-engineering
scikit-learn
artificial-intelligence
statistics
pandas
mlops
jupyter-notebook
data
tensorflow
pytorch
natural-language-processing
ai
automl
computer-vision
data-quality
workflow
nlp
hacktoberfest
big-data
data-mining
finance
pipeline
react
analytics
data-analytics
neural-network
kubernetes

scikit-learn/scikit-learn
346日前57.4k

scikit-learn: machine learning in Python

pandas-dev/pandas
346日前41.2k

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

GokuMohandas/Made-With-ML
347日前35.1k

Learn how to design, develop, deploy and iterate on production-grade ML applications.

streamlit/streamlit
347日前30.2k

Streamlit — A faster way to build and share data apps.

ray-project/ray
347日前29.9k

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

AMAI-GmbH/AI-Expert-Roadmap
346日前27.9k

Roadmap to becoming an Artificial Intelligence Expert in 2022

gradio-app/gradio
347日前26.3k

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Lightning-AI/pytorch-lightning
345日前26.2k

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

CamDavidsonPilon/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
346日前26.2k

aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

donnemartin/data-science-ipython-notebooks
347日前26.1k

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

eriklindernoren/ML-From-Scratch
345日前22.9k

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.

d2l-ai/d2l-en
346日前20.9k

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

matplotlib/matplotlib
347日前18.9k

matplotlib: plotting with Python

qax-os/excelize
346日前16.8k

Go language library for reading and writing Microsoft Excel™ (XLAM / XLSM / XLSX / XLTM / XLTX) spreadsheets

ml-tooling/best-of-ml-python
347日前15.1k

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

ashishpatel26/500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code
346日前14.7k

500 AI Machine learning Deep learning Computer vision NLP Projects with code

PrefectHQ/prefect
345日前14.1k

Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines

dair-ai/ML-YouTube-Courses
346日前13.5k

📺 Discover the latest machine learning / AI courses on YouTube.

rasbt/python-machine-learning-book
347日前12.1k

The "Python Machine Learning (1st edition)" book code repository and info resource

OpenRefine/OpenRefine
348日前10.3k

OpenRefine is a free, open source power tool for working with messy data and improving it

fastai/numerical-linear-algebra
347日前9.9k

Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course

Sinaptik-AI/pandas-ai
346日前9.8k

Chat with your data (SQL, CSV, pandas, polars, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

aws/amazon-sagemaker-examples
347日前9.3k

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

goplus/gop
346日前8.7k

The Go+ programming language is designed for engineering, STEM education, and data science

akfamily/akshare
346日前7.9k

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

tangyudi/Ai-Learn
346日前7.9k

人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域

catboost/catboost
346日前7.6k

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

mrdbourke/machine-learning-roadmap
345日前7.1k

A roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.

rasbt/python-machine-learning-book-2nd-edition
348日前7.0k

The "Python Machine Learning (2nd edition)" book code repository and info resource

firmai/industry-machine-learning
347日前7.0k

A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)

h2oai/h2o-3
346日前6.6k

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

mage-ai/mage-ai
345日前6.5k

🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.

mahmoud/boltons
347日前6.4k

🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.

javascriptdata/danfojs
346日前4.6k

Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.

flyteorg/flyte
345日前4.5k

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

goq/telegram-list
347日前4.4k

List of telegram groups, channels & bots // Список интересных групп, каналов и ботов телеграма // Список чатов для программистов

hadley/r4ds
347日前4.3k

R for data science: a book

nteract/hydrogen
350日前3.9k

:atom: Run code interactively, inspect data, and plot. All the power of Jupyter kernels, inside your favorite text editor.

marimo-team/marimo
346日前3.3k

A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.

lancedb/lance
345日前3.1k

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..

evidence-dev/evidence
345日前2.9k

Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown..

diffgram/diffgram
348日前1.8k

The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

GoogleCloudPlatform/vertex-ai-samples
347日前1.2k

Sample code and notebooks for Vertex AI, the end-to-end machine learning platform on Google Cloud