/

BigData-Notes

大数据入门指南 :star:

最終更新日:354日前
15.0k

bamlab/generator-rn-toolbox
362日前1.2k

The React Native Generator to bootstrap your apps

donnemartin/data-science-ipython-notebooks
355日前26.1k

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

deeplearning4j/deeplearning4j
355日前13.3k

Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learning using automatic differentiation.

lk-geimfari/awesomo
356日前9.2k

Cool open source projects. Choose your project and get involved in Open Source development now.

izhangzhihao/intellij-rainbow-brackets
355日前4.3k

🌈Rainbow Brackets for IntelliJ based IDEs/Android Studio/HUAWEI DevEco Studio/Fleet

plausible/analytics
355日前17.6k

Simple, open source, lightweight (< 1 KB) and privacy-friendly web analytics alternative to Google Analytics.

livebook-dev/livebook
355日前4.2k

Automate code & data workflows with interactive Elixir notebooks

ueberauth/guardian
356日前3.4k

Elixir Authentication

miniMAC/magic
354日前8.3k

CSS3 Animations with special effects

hmemcpy/milewski-ctfp-pdf
355日前10.6k

Bartosz Milewski's 'Category Theory for Programmers' unofficial PDF and LaTeX source

apache/spark
354日前37.8k

Apache Spark - A unified analytics engine for large-scale data processing

nteract/papermill
355日前5.5k

📚 Parameterize, execute, and analyze notebooks

thechangelog/changelog.com
357日前2.6k

Changelog is news and podcast for developers. This is our open source platform.

geekyouth/SZT-bigdata
354日前2.1k

深圳地铁大数据客流分析系统🚇🚄🌟

phoenixframework/phoenix_live_dashboard
356日前1.9k

Realtime dashboard with metrics, request logging, plus storage, OS and VM insights

xusenlinzy/api-for-open-llm
354日前1.6k

Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc. 开源大模型的统一后端接口

beam-community/elixir-companies
358日前1.6k

A list of companies currently using Elixir in production.

elixir-wallaby/wallaby
354日前1.6k

Concurrent browser tests for your Elixir web apps.

pow-auth/pow
359日前1.5k

Robust, modular, and extendable user authentication system

aesmail/kaffy
358日前1.2k

Powerfully simple admin package for phoenix applications

zoonk/uneebee
354日前1.1k

Platform for creating interactive courses.

phcode-dev/phoenix
355日前1.1k

Phoenix is a modern open-source Code Editor for the web, built for the browser.

hexpm/hexpm
358日前1.0k

API server and website for Hex

mojotech/torch
358日前1.0k

A rapid admin generator for Elixir & Phoenix

JanusGraph/janusgraph
355日前5.0k

JanusGraph: an open-source, distributed graph database

build-trust/ockam
354日前4.3k

Orchestrate end-to-end encryption, cryptographic identities, mutual authentication, and authorization policies between distributed applications – at massive scale.

vector4wang/spring-boot-quick
354日前2.4k

:herb: 基于springboot的快速学习示例,整合自己遇到的开源框架,如:rabbitmq(延迟队列)、Kafka、jpa、redies、oauth2、swagger、jsp、docker、k3s、k3d、k8s、mybatis加解密插件、异常处理、日志输出、多模块开发、多环境打包、缓存cache、爬虫、jwt、GraphQL、dubbo、zookeeper和Async等等:pushpin:

kunalkapadia/express-mongoose-es6-rest-api
358日前2.9k

:collision: A boilerplate application for building RESTful APIs Microservice in Node.js using express and mongoose in ES6 with code coverage and JsonWebToken Authentication

taosdata/TDengine
354日前22.6k

TDengine is an open source, high-performance, cloud native time-series database optimized for Internet of Things (IoT), Connected Cars, Industrial IoT and DevOps.

AobingJava/JavaFamily
354日前35.0k

【Java面试+Java学习指南】 一份涵盖大部分Java程序员所需要掌握的核心知识。

getredash/redash
354日前24.6k

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

linkedin/school-of-sre
354日前7.6k

At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.

apache/zeppelin
354日前6.2k

Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

yarnpkg/yarn
354日前41.3k

The 1.x line is frozen - features and bugfixes now happen on https://github.com/yarnpkg/berry

jaredpalmer/tsdx
355日前11.1k

Zero-config CLI for TypeScript package development

sindresorhus/np
354日前7.3k

A better `npm publish`

apache/avro
358日前2.7k

Apache Avro is a data serialization system.

HariSekhon/DevOps-Bash-tools
355日前2.3k

1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, tmux..

thingsboard/thingsboard
354日前15.2k

Open-source IoT Platform - Device management, data collection, processing and visualization.

apache/shardingsphere
354日前19.2k

Distributed SQL transaction & query engine for data sharding, scaling, encryption, and more - on any database.

boltpkg/bolt
364日前2.3k

⚡️ Super-powered JavaScript project management

FavioVazquez/ds-cheatsheets
355日前12.6k

List of Data Science Cheatsheets to rule the world

h2oai/h2o-3
354日前6.6k

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

apache/kafka
354日前26.8k

Mirror of Apache Kafka

apache/flink
354日前22.8k

Apache Flink

lichess-org/lila
354日前14.3k

♞ lichess.org: the forever free, adless and open source chess server ♞

scala/scala
355日前14.3k

Scala 2 compiler and standard library. Bugs at https://github.com/scala/bug; Scala 3 at https://github.com/lampepfl/dotty

apache/predictionio
356日前12.6k

PredictionIO, a machine learning server for developers and ML engineers.

playframework/playframework
354日前12.5k

The Community Maintained High Velocity Web Framework For Java and Scala.

yahoo/CMAK
354日前11.6k

CMAK is a tool for managing Apache Kafka clusters

gitbucket/gitbucket
355日前9.0k

A Git platform powered by Scala with easy installation, high extensibility & GitHub API compatibility

twitter/finagle
355日前8.7k

A fault tolerant, protocol-agnostic RPC system

Angel-ML/angel
358日前6.7k

A Flexible and Powerful Parameter Server for large-scale machine learning

gatling/gatling
355日前6.2k

Modern Load Testing as Code

cube-js/cube
354日前16.9k

📊 Cube — The Semantic Layer for Building Data Applications

Tencent/APIJSON
354日前16.4k

🏆 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构。 🏆 A JSON Transmission Protocol and an ORM Library 🚀 provides APIs and Docs without writing any code.

yudaocode/SpringBoot-Labs
354日前18.1k

一个涵盖六个专栏:Spring Boot 2.X、Spring Cloud、Spring Cloud Alibaba、Dubbo、分布式消息队列、分布式事务的仓库。希望胖友小手一抖,右上角来个 Star,感恩 1024

aalansehaiyang/technology-talk
353日前13.8k

【大厂面试专栏】一份Java程序员需要的技术指南,这里有面试题、系统架构、职场锦囊、主流中间件等,让你成为更牛的自己!

apache/storm
354日前6.5k

Apache Storm

shzlw/poli
360日前1.9k

An easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights.

DataTalksClub/data-engineering-zoomcamp
353日前21.1k

Free Data Engineering course!

mage-ai/mage-ai
353日前6.5k

🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.

risingwavelabs/risingwave
353日前5.9k

Scalable Postgres for stream processing, analytics, and management. KsqlDB and Apache Flink alternative. 🚀 10x more productive. 🚀 10x more cost-efficient.

chen3feng/blade-build
358日前2.0k

Blade is a powerful build system from Tencent, supports many mainstream programming languages, such as C/C++, java, scala, python, protobuf...

com-lihaoyi/mill
366日前1.9k

Your shiny new Java/Scala build tool!

redpanda-data/redpanda
353日前8.5k

Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!

sbt/sbt
354日前4.7k

sbt, the interactive build tool

kubeshark/kubeshark
353日前10.3k

The API traffic analyzer for Kubernetes providing real-time K8s protocol-level visibility, capturing and monitoring all traffic and payloads going in, out and across containers, pods, nodes and clusters. Inspired by Wireshark, purposely built for Kubernetes

prestodb/presto
353日前15.4k

The official home of the Presto distributed SQL query engine for big data

DataEngineer-io/data-engineer-handbook
353日前6.6k

This is a repo with links to everything you'd ever want to learn about data engineering

vran-dev/PrettyZoo
353日前3.0k

😉 Pretty nice Zookeeper GUI, Support Win / Mac / Linux Platform

apache/dolphinscheduler
353日前11.7k

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

TheHive-Project/TheHive
353日前3.1k

TheHive: a Scalable, Open Source and Free Security Incident Response Platform