/

data

sql
data-science
react
data-analysis
python
data-engineering
javascript
csv
pipeline
data-visualization
query
database
visualization
data-integration
elt
etl
machine-learning
fetch
analytics
bi
json
workflow
orchestration
data-pipeline
data-collection
datasets
kubernetes
golang
hooks
cache
stale-while-revalidate
rest

TanStack/query
582日前38.5k

🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.

metabase/metabase
580日前35.8k

The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:

SheetJS/sheetjs
581日前34.2k

📗 SheetJS Spreadsheet Data Toolkit -- New home https://git.sheetjs.com/SheetJS/sheetjs

run-llama/llama_index
578日前28.2k

LlamaIndex (formerly GPT Index) is a data framework for your LLM applications

fivethirtyeight/data
579日前16.6k

Data and code behind the articles and graphics at FiveThirtyEight

prestodb/presto
578日前15.4k

The official home of the Presto distributed SQL query engine for big data

PrefectHQ/prefect
578日前14.1k

Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines

airbytehq/airbyte
579日前13.2k

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

faker-js/faker
578日前11.4k

Generate massive amounts of fake data in the browser and node.js

pwxcoo/chinese-xinhua
578日前10.5k

:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。

Sinaptik-AI/pandas-ai
579日前9.8k

Chat with your data (SQL, CSV, pandas, polars, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

PRQL/prql
578日前9.1k

PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

rawgraphs/rawgraphs-app
578日前8.5k

A web interface to create custom vector-based visualizations on top of RAWGraphs core

bchavez/Bogus
578日前8.1k

:card_index: A simple fake data generator for C#, F#, and VB.NET. Based on and ported from the famed faker.js.

akfamily/akshare
579日前7.9k

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

mrdbourke/machine-learning-roadmap
578日前7.1k

A roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.

snowplow/snowplow
579日前6.7k

The enterprise-grade behavioral data engine (web, mobile, server-side, webhooks), running cloud-natively on AWS and GCP

DataEngineer-io/data-engineer-handbook
578日前6.6k

This is a repo with links to everything you'd ever want to learn about data engineering

mage-ai/mage-ai
578日前6.5k

🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.

cloudquery/cloudquery
580日前5.4k

The open source high performance data integration platform built for developers.

kestra-io/kestra
578日前5.4k

Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.

flyteorg/flyte
578日前4.5k

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

jonschlinkert/gray-matter
579日前3.7k

Smarter YAML front matter parser, used by metalsmith, Gatsby, Netlify, Assemble, mapbox-gl, phenomic, vuejs vitepress, TinaCMS, Shopify Polaris, Ant Design, Astro, hashicorp, garden, slidev, saber, sourcegraph, and many others. Simple to use, and battle tested. Parses YAML by default but can also parse JSON Front Matter, Coffee Front Matter, TOML Front Matter, and has support for custom parsers. Please follow gray-matter's author: https://github.com/jonschlinkert

heroku/react-refetch
585日前3.4k

A simple, declarative, and composable way to fetch data for React components

glideapps/glide-data-grid
580日前3.3k

🚀 Glide Data Grid is a no compromise, outrageously react fast data grid with rich rendering, first class accessibility, and full TypeScript support.

uber/aresdb
590日前3.0k

A GPU-powered real-time analytics storage and query engine.

kayak/pypika
578日前2.3k

PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.

Kanaries/graphic-walker
579日前2.1k

An open source alternative to Tableau. Easily embedded in any web apps.

keajs/kea
582日前1.9k

Batteries Included State Management for React

mahmoud/glom
589日前1.8k

☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️

diffgram/diffgram
581日前1.8k

The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

brimdata/zui
581日前1.7k

Zui is a powerful desktop application for exploring and working with data. The official front-end to the Zed lake.

JuliaData/DataFrames.jl
581日前1.7k

In-memory tabular data in Julia

dataliterate/data-populator
595日前1.7k

A plugin for Sketch and Adobe XD to populate your design mockups with meaningful data. Goodbye Lorem Ipsum. Hello JSON.

chartshq/muze
602日前1.2k

Composable data visualisation library for web with a data-first approach now powered by WebAssembly

rilldata/rill
578日前1.2k

Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.