/

data-engineer-handbook

This is a repo with links to everything you'd ever want to learn about data engineering

最終更新日:578日前
6.6k

TanStack/query
582日前38.5k

🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.

keajs/kea
582日前1.9k

Batteries Included State Management for React

heroku/react-refetch
585日前3.4k

A simple, declarative, and composable way to fetch data for React components

FredericHeem/starhackit
586日前1.3k

StarHackIt: React/Native/Node fullstack starter kit with authentication and authorisation, data backed by SQL, the infrastructure deployed with GruCloud

bitwarden/server
580日前14.0k

The core infrastructure backend (API, database, Docker, etc).

microsoft/azuredatastudio
582日前7.4k

Azure Data Studio is a data management and development tool with connectivity to popular cloud and on-premises databases. Azure Data Studio supports Windows, macOS, and Linux, with immediate capability to connect to Azure SQL and SQL Server. Browse the extension library for more database support options including MySQL, PostgreSQL, and MongoDB.

serhii-londar/open-source-mac-os-apps
580日前38.7k

🚀 Awesome list of open source applications for macOS. https://t.me/s/opensourcemacosapps

metabase/metabase
580日前35.8k

The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:

lk-geimfari/awesomo
581日前9.2k

Cool open source projects. Choose your project and get involved in Open Source development now.

pingcap/tidb
580日前35.7k

TiDB is an open-source, cloud-native, distributed, MySQL-Compatible database for elastic scale and real-time analytics. Try AI-powered Chat2Query free at : https://tidbcloud.com/free-trial

SheetJS/sheetjs
581日前34.2k

📗 SheetJS Spreadsheet Data Toolkit -- New home https://git.sheetjs.com/SheetJS/sheetjs

cockroachdb/cockroach
580日前28.7k

CockroachDB - the open source, cloud-native distributed SQL database.

electric-sql/electric
581日前4.0k

Local-first sync layer for web and mobile apps. Build reactive, realtime, local-first apps directly on Postgres.

amitshekhariitbhu/Android-Debug-Database
580日前8.3k

A library for debugging android databases and shared preferences - Make Debugging Great Again

OffcierCia/DeFi-Developer-Road-Map
580日前9.3k

DeFi Developer roadmap is a curated Developer handbook which includes a list of the best tools for DApps development, resources and references!

dpgaspar/Flask-AppBuilder
580日前4.5k

Simple and rapid application development framework, built on top of Flask. includes detailed security, auto CRUD generation for your models, google charts and much more. Demo (login with guest/welcome) - http://flaskappbuilder.pythonanywhere.com/

dypsilon/frontend-dev-bookmarks
580日前40.4k

Manually curated collection of resources for frontend web developers.

ascoders/weekly
580日前26.7k

前端精读周刊。帮你理解最前沿、实用的技术。

ellisonleao/magictools
580日前12.6k

:video_game: :pencil: A list of Game Development resources to make magic happen.

cloudquery/cloudquery
580日前5.4k

The open source high performance data integration platform built for developers.

dkhamsing/open-source-ios-apps
579日前38.3k

:iphone: Collaborative List of Open-Source iOS Apps

apache/spark
579日前37.8k

Apache Spark - A unified analytics engine for large-scale data processing

multiprocessio/datastation
581日前2.8k

App to easily query, script, and visualize data from every database, file, and API.

JuliaData/DataFrames.jl
581日前1.7k

In-memory tabular data in Julia

airbytehq/airbyte
579日前13.2k

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Sinaptik-AI/pandas-ai
579日前9.8k

Chat with your data (SQL, CSV, pandas, polars, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

borisdj/EFCore.BulkExtensions
580日前3.4k

Entity Framework EF Core efcore Bulk Batch Extensions with BulkCopy in .Net for Insert Update Delete Read (CRUD), Truncate and SaveChanges operations on SQL Server, PostgreSQL, MySQL, SQLite

akfamily/akshare
579日前7.9k

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

OdyseeTeam/chainquery
580日前2.3k

Chainquery parses and syncs the LBRY blockchain data into structured SQL

xlucn/oh-my-foss-android
579日前1.6k

个人收集的实用、良心开源安卓软件

SpaceVim/SpaceVim
580日前20.1k

A community-driven modular vim/neovim distribution - The ultimate vimrc

kaxap/arl
580日前1.9k

lists of most popular repositories for most favoured programming languages (according to StackOverflow)

taosdata/TDengine
579日前22.6k

TDengine is an open source, high-performance, cloud native time-series database optimized for Internet of Things (IoT), Connected Cars, Industrial IoT and DevOps.

osquery/osquery
580日前20.9k

SQL powered operating system instrumentation, monitoring, and analytics.

sequelize/sequelize
579日前28.9k

Feature-rich ORM for modern Node.js and TypeScript, it supports PostgreSQL (with JSON and JSONB support), MySQL, MariaDB, SQLite, MS SQL Server, Snowflake, Oracle DB (v6), DB2 and DB2 for IBM i.

ashishpatel26/500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code
579日前14.7k

500 AI Machine learning Deep learning Computer vision NLP Projects with code

orientechnologies/orientdb
581日前4.7k

OrientDB is the most versatile DBMS supporting Graph, Document, Reactive, Full-Text and Geospatial models in one Multi-Model product. OrientDB can run distributed (Multi-Master), supports SQL, ACID Transactions, Full-Text indexing and Reactive Queries.

Awesome-HarmonyOS/HarmonyOS
579日前19.0k

A curated list of awesome things related to HarmonyOS. 华为鸿蒙操作系统。

apache/avro
583日前2.7k

Apache Avro is a data serialization system.

pocoproject/poco
579日前7.7k

The POCO C++ Libraries are powerful cross-platform C++ libraries for building network- and internet-based applications that run on desktop, server, mobile, IoT, and embedded systems.

timescale/timescaledb
580日前16.2k

An open-source time-series SQL database optimized for fast ingest and complex queries. Packaged as a PostgreSQL extension.

apache/shardingsphere
579日前19.2k

Distributed SQL transaction & query engine for data sharding, scaling, encryption, and more - on any database.

knex/knex
579日前18.5k

A query builder for PostgreSQL, MySQL, CockroachDB, SQL Server, SQLite3 and Oracle, designed to be flexible, portable, and fun to use.

Leantime/leantime
580日前3.9k

Leantime is a goals focused project management system for non-project managers. Building with ADHD, Autism, and dyslexia in mind.

apache/flink
579日前22.8k

Apache Flink

cube-js/cube
579日前16.9k

📊 Cube — The Semantic Layer for Building Data Applications

dataliterate/data-populator
595日前1.7k

A plugin for Sketch and Adobe XD to populate your design mockups with meaningful data. Goodbye Lorem Ipsum. Hello JSON.

ClickHouse/ClickHouse
579日前33.1k

ClickHouse® is a free analytics DBMS for big data

mybatis/mybatis-3
579日前19.2k

MyBatis SQL mapper framework for Java

forthespada/CS-Books
579日前17.8k

🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~

drizzle-team/drizzle-orm
579日前17.3k

Headless TypeScript ORM with a head. Runs on Node, Bun and Deno. Lives on the Edge and yes, it's a JavaScript ORM too 😅

DapperLib/Dapper
579日前16.9k

Dapper - a simple object mapper for .Net

turbot/steampipe
579日前6.2k

Zero-ETL, infinite possibilities. Live query APIs, code & more with SQL. No DB required.

mendel5/alternative-front-ends
579日前5.9k

Overview of alternative open source front-ends for popular internet platforms (e.g. YouTube, Twitter, etc.)

mhinz/vim-galore
579日前16.2k

:mortar_board: All things Vim!

dexteryy/spellbook-of-modern-webdev
579日前16.6k

A Big Picture, Thesaurus, and Taxonomy of Modern JavaScript Web Development

dr5hn/countries-states-cities-database
580日前6.1k

🌍 Discover our global repository of countries, states, and cities! 🏙️ Get comprehensive data in JSON, SQL, PSQL, XML, YAML, and CSV formats. Access ISO2, ISO3 codes, country code, capital, native language, timezones (for countries), and more. #countries #states #cities

glideapps/glide-data-grid
580日前3.3k

🚀 Glide Data Grid is a no compromise, outrageously react fast data grid with rich rendering, first class accessibility, and full TypeScript support.

bolshchikov/js-must-watch
579日前13.1k

Must-watch videos about javascript

flyteorg/flyte
578日前4.5k

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

mahmoud/glom
589日前1.8k

☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️

ygit/swiftui
583日前1.2k

A collaborative list of awesome SwiftUI resources. Feel free to contribute!

shekhargulati/52-technologies-in-2016
582日前7.2k

Let's learn a new technology every week. A new technology blog every Sunday in 2016.

dariubs/GoBooks
578日前15.6k

List of Golang books

mhadidg/software-architecture-books
578日前9.0k

A comprehensive list of books on Software Architecture.

duckdb/duckdb
578日前14.6k

DuckDB is an in-process SQL OLAP Database Management System

phanan/htaccess
579日前12.5k

✂A collection of useful .htaccess snippets.

Kanaries/graphic-walker
579日前2.1k

An open source alternative to Tableau. Easily embedded in any web apps.

rilldata/rill
578日前1.2k

Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.

datalens-tech/datalens
580日前1.2k

A modern, scalable analytics system

ankane/blazer
579日前3.8k

Business intelligence made simple

evidence-dev/evidence
578日前2.9k

Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown..

shzlw/poli
585日前1.9k

An easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights.

diffgram/diffgram
581日前1.8k

The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

brimdata/zui
581日前1.7k

Zui is a powerful desktop application for exploring and working with data. The official front-end to the Zed lake.

PrefectHQ/prefect
578日前14.1k

Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines

mage-ai/mage-ai
578日前6.5k

🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.

risingwavelabs/risingwave
578日前5.9k

Scalable Postgres for stream processing, analytics, and management. KsqlDB and Apache Flink alternative. 🚀 10x more productive. 🚀 10x more cost-efficient.

kestra-io/kestra
578日前5.4k

Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.

chartshq/muze
602日前1.2k

Composable data visualisation library for web with a data-first approach now powered by WebAssembly

slashbaseide/slashbase
579日前1.3k

Modern database IDE for your dev & data workflows. Supports MySQL, PostgreSQL & MongoDB.

cloudera/hue
579日前1.0k

Open source SQL Query Assistant service for Databases/Warehouses

jonschlinkert/gray-matter
579日前3.7k

Smarter YAML front matter parser, used by metalsmith, Gatsby, Netlify, Assemble, mapbox-gl, phenomic, vuejs vitepress, TinaCMS, Shopify Polaris, Ant Design, Astro, hashicorp, garden, slidev, saber, sourcegraph, and many others. Simple to use, and battle tested. Parses YAML by default but can also parse JSON Front Matter, Coffee Front Matter, TOML Front Matter, and has support for custom parsers. Please follow gray-matter's author: https://github.com/jonschlinkert

fnc12/sqlite_orm
579日前2.1k

❤️ SQLite ORM light header only library for modern C++

rdbende/Sun-Valley-ttk-theme
578日前1.6k

A gorgeous theme for Tkinter/ttk, based on the Sun Valley visual style ✨

launchbadge/sqlx
578日前11.2k

🧰 The Rust SQL Toolkit. An async, pure Rust SQL crate featuring compile-time checked queries without a DSL. Supports PostgreSQL, MySQL, and SQLite.

run-llama/llama_index
578日前28.2k

LlamaIndex (formerly GPT Index) is a data framework for your LLM applications

fivethirtyeight/data
579日前16.6k

Data and code behind the articles and graphics at FiveThirtyEight

prestodb/presto
578日前15.4k

The official home of the Presto distributed SQL query engine for big data

faker-js/faker
578日前11.4k

Generate massive amounts of fake data in the browser and node.js

pwxcoo/chinese-xinhua
578日前10.5k

:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。

PRQL/prql
578日前9.1k

PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

rawgraphs/rawgraphs-app
578日前8.5k

A web interface to create custom vector-based visualizations on top of RAWGraphs core

bchavez/Bogus
578日前8.1k

:card_index: A simple fake data generator for C#, F#, and VB.NET. Based on and ported from the famed faker.js.

mrdbourke/machine-learning-roadmap
578日前7.1k

A roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.

snowplow/snowplow
579日前6.7k

The enterprise-grade behavioral data engine (web, mobile, server-side, webhooks), running cloud-natively on AWS and GCP

cube2222/octosql
579日前4.7k

OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.

whoiskatrin/sql-translator
579日前3.9k

SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.

roapi/roapi
579日前3.0k

Create full-fledged APIs for slowly moving datasets without writing a single line of code.

uber/aresdb
590日前3.0k

A GPU-powered real-time analytics storage and query engine.

adelsz/pgtyped
579日前2.7k

pgTyped - Typesafe SQL in TypeScript

kayak/pypika
578日前2.3k

PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.