/

dplyr

dplyr: A grammar of data manipulation

最終更新日:354日前
4.6k

antlr/antlr4
355日前16.1k

ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.

BayesWitnesses/m2cgen
355日前2.7k

Transform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies

apache/spark
354日前37.8k

Apache Spark - A unified analytics engine for large-scale data processing

nteract/papermill
355日前5.5k

📚 Parameterize, execute, and analyze notebooks

szcf-weiya/ESL-CN
355日前2.3k

The Elements of Statistical Learning (ESL)的中文翻译、代码实现及其习题解答。

gchq/CyberChef
354日前24.3k

The Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis

kaxap/arl
355日前1.9k

lists of most popular repositories for most favoured programming languages (according to StackOverflow)

facebook/prophet
354日前17.5k

Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.

microsoft/LightGBM
354日前15.9k

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

FavioVazquez/ds-cheatsheets
355日前12.6k

List of Data Science Cheatsheets to rule the world

HugoBlox/hugo-blox-builder
354日前7.6k

😍 EASILY BUILD THE WEBSITE YOU WANT - NO CODE, JUST MARKDOWN BLOCKS! 使用块轻松创建任何类型的网站 - 无需代码。 一个应用程序,没有依赖项,没有 JS

catboost/catboost
354日前7.6k

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

h2oai/h2o-3
354日前6.6k

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

tidyverse/ggplot2
354日前6.2k

An implementation of the Grammar of Graphics in R

cxli233/FriendsDontLetFriends
354日前5.4k

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

rstudio/shiny
355日前5.2k

Easy interactive web applications with R

hadley/r4ds
355日前4.3k

R for data science: a book

h2oai/wave
354日前3.8k

Realtime Web Apps and Dashboards for Python and R

javascriptdata/danfojs
354日前4.6k

Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.