/

r

python
java
machine-learning
data-science
big-data
gbm
jupyter
php
c
scala
spark
gbdt
decision-trees
gradient-boosting
distributed
kaggle
lightgbm
datascience
ruby
swift
javascript
rstudio
julia
typescript
csharp
statistical-learning
cpp
sql
jdbc
forecasting
gbrt
parallel

apache/spark
502日前37.8k

Apache Spark - A unified analytics engine for large-scale data processing

facebook/prophet
502日前17.5k

Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.

microsoft/LightGBM
502日前15.9k

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

FavioVazquez/ds-cheatsheets
503日前12.6k

List of Data Science Cheatsheets to rule the world

HugoBlox/hugo-blox-builder
502日前7.6k

😍 EASILY BUILD THE WEBSITE YOU WANT - NO CODE, JUST MARKDOWN BLOCKS! 使用块轻松创建任何类型的网站 - 无需代码。 一个应用程序,没有依赖项,没有 JS

catboost/catboost
502日前7.6k

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

h2oai/h2o-3
502日前6.6k

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

tidyverse/ggplot2
502日前6.2k

An implementation of the Grammar of Graphics in R

nteract/papermill
503日前5.5k

📚 Parameterize, execute, and analyze notebooks

cxli233/FriendsDontLetFriends
502日前5.4k

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

rstudio/shiny
503日前5.2k

Easy interactive web applications with R

tidyverse/dplyr
502日前4.6k

dplyr: A grammar of data manipulation

hadley/r4ds
503日前4.3k

R for data science: a book

h2oai/wave
502日前3.8k

Realtime Web Apps and Dashboards for Python and R

BayesWitnesses/m2cgen
503日前2.7k

Transform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies

szcf-weiya/ESL-CN
503日前2.3k

The Elements of Statistical Learning (ESL)的中文翻译、代码实现及其习题解答。

kaxap/arl
503日前1.9k

lists of most popular repositories for most favoured programming languages (according to StackOverflow)