data-science
keras-team/keras347日前60.3k
Deep Learning for humans
scikit-learn/scikit-learn346日前57.4k
scikit-learn: machine learning in Python
apache/superset351日前56.6k
Apache Superset is a Data Visualization and Data Exploration Platform
pandas-dev/pandas346日前41.2k
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
GokuMohandas/Made-With-ML347日前35.1k
Learn how to design, develop, deploy and iterate on production-grade ML applications.
apache/airflow345日前33.6k
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
streamlit/streamlit347日前30.2k
Streamlit — A faster way to build and share data apps.
ray-project/ray347日前29.9k
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
explosion/spaCy346日前28.3k
💫 Industrial-strength Natural Language Processing (NLP) in Python
AMAI-GmbH/AI-Expert-Roadmap346日前27.9k
Roadmap to becoming an Artificial Intelligence Expert in 2022
gradio-app/gradio347日前26.3k
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Lightning-AI/pytorch-lightning345日前26.2k
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
CamDavidsonPilon/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers346日前26.2k
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
donnemartin/data-science-ipython-notebooks347日前26.1k
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
eugeneyan/applied-ml345日前25.7k
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
eriklindernoren/ML-From-Scratch345日前22.9k
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
d2l-ai/d2l-en346日前20.9k
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
plotly/dash347日前20.1k
Data Apps & Dashboards for Python. No JavaScript Required.
matplotlib/matplotlib347日前18.9k
matplotlib: plotting with Python
recommenders-team/recommenders347日前17.6k
Best Practices on Recommendation Systems
qax-os/excelize346日前16.8k
Go language library for reading and writing Microsoft Excel™ (XLAM / XLSM / XLSX / XLTM / XLTX) spreadsheets
ml-tooling/best-of-ml-python347日前15.1k
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
piskvorky/gensim347日前15.0k
Topic Modelling for Humans
ashishpatel26/500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code346日前14.7k
500 AI Machine learning Deep learning Computer vision NLP Projects with code
PrefectHQ/prefect345日前14.1k
Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines
dair-ai/ML-YouTube-Courses346日前13.5k
📺 Discover the latest machine learning / AI courses on YouTube.
rasbt/python-machine-learning-book347日前12.1k
The "Python Machine Learning (1st edition)" book code repository and info resource
ydataai/ydata-profiling347日前11.8k
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
stefan-jansen/machine-learning-for-trading346日前11.3k
Code for Machine Learning for Algorithmic Trading, 2nd edition.
OpenRefine/OpenRefine348日前10.3k
OpenRefine is a free, open source power tool for working with messy data and improving it
fastai/numerical-linear-algebra347日前9.9k
Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course
Sinaptik-AI/pandas-ai346日前9.8k
Chat with your data (SQL, CSV, pandas, polars, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
dagster-io/dagster345日前9.7k
An orchestration platform for the development, production, and observation of data assets.
EpistasisLab/tpot347日前9.4k
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
statsmodels/statsmodels346日前9.3k
Statsmodels: statistical modeling and econometrics in Python
Yorko/mlcourse.ai347日前9.3k
Open Machine Learning Course
aws/amazon-sagemaker-examples347日前9.3k
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
great-expectations/great_expectations346日前9.3k
Always know what to expect from your data.
microsoft/computervision-recipes347日前9.2k
Best Practices, code samples, and documentation for Computer Vision.
goplus/gop346日前8.7k
The Go+ programming language is designed for engineering, STEM education, and data science
akfamily/akshare346日前7.9k
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
tangyudi/Ai-Learn346日前7.9k
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
catboost/catboost346日前7.6k
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
Netflix/metaflow347日前7.4k
:rocket: Build and manage real-life ML, AI, and data science projects with ease!
sktime/sktime346日前7.2k
A unified framework for machine learning with time series
mrdbourke/machine-learning-roadmap345日前7.1k
A roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.
rasbt/python-machine-learning-book-2nd-edition348日前7.0k
The "Python Machine Learning (2nd edition)" book code repository and info resource
firmai/industry-machine-learning347日前7.0k
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
alteryx/featuretools347日前7.0k
An open source python library for automated feature engineering
autogluon/autogluon346日前6.8k
AutoGluon: AutoML for Image, Text, Time Series, and Tabular Data
h2oai/h2o-3346日前6.6k
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
mage-ai/mage-ai345日前6.5k
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
mahmoud/boltons347日前6.4k
🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.
nteract/nteract347日前6.1k
📘 The interactive computing suite for you! ✨
feast-dev/feast345日前5.1k
Feature Store for Machine Learning
javascriptdata/danfojs346日前4.6k
Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
flyteorg/flyte345日前4.5k
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
man-group/dtale347日前4.4k
Visualizer for pandas data structures
goq/telegram-list347日前4.4k
List of telegram groups, channels & bots // Список интересных групп, каналов и ботов телеграма // Список чатов для программистов
iterative/cml355日前3.9k
♾️ CML - Continuous Machine Learning | CI/CD for ML
nteract/hydrogen350日前3.9k
:atom: Run code interactively, inspect data, and plot. All the power of Jupyter kernels, inside your favorite text editor.
polakowo/vectorbt346日前3.6k
Find your trading edge, using the fastest engine for backtesting, algorithmic trading, and research.
fastai/fastpages356日前3.5k
An easy to use blogging platform, with enhanced support for Jupyter Notebooks.
marimo-team/marimo346日前3.3k
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
lancedb/lance345日前3.1k
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
evidence-dev/evidence345日前2.9k
Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown..
diffgram/diffgram348日前1.8k
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
pydoit/doit346日前1.8k
task management & automation tool
alan-turing-institute/MLJ.jl348日前1.7k
A Julia machine learning framework
MilesCranmer/PySR346日前1.6k
High-Performance Symbolic Regression in Python and Julia
kantord/just-dashboard346日前1.6k
:bar_chart: :clipboard: Dashboards using YAML or JSON files
GoogleCloudPlatform/vertex-ai-samples347日前1.2k
Sample code and notebooks for Vertex AI, the end-to-end machine learning platform on Google Cloud