Show HN: Xorq – open-source Python-first Pandas-style pipelines

Hi HN, Dan, Hussain and Daniel here… After years of struggling with data pipelines that worked in notebooks but failed in production, we decided to do something about it. We created xorq to eliminate the constant headaches of SQL/pandas impedance mismatch, runtime debugging, wasteful recomputations and unreliable research-to-production deployments that plague traditional pandas-style pipeline workflows. xorq is built on Ibis and DataFusion.We’d love your feedback and contributions. xorq is [Apache 2.0 licensed](https://github.com/letsql/xorq/blob/main/LICENSE) to encourage open collaboration.Repo: https://github.com/letsql/xorqDocs: https://docs.xorq.devRoadmap Issues: https://github.com/letsql/xorqYou can get started `pip install xorq`.Or, if you use nix, you can simply run `nix run github:xorq-labs/xorq` and drop into an IPython shell.Demo video: https://youtu.be/jUk8vrR6bCwHere are some vignettes to look into next:1. MCP Server + Flight + XGBoost: https://docs.xorq.dev/vignettes/mcp_flight_server2. 1 DuckDB + 2 Writers + 1 Reader: https://docs.xorq.dev/vignettes/duckdb_concurrent3. OpenAI UDF: https://docs.xorq.dev/tutorials/hn_data_prepSome features to note:- Ibis-based multi-engine expression system: effortless engine-to-engine streaming- Cache expressions with `.cache` operator- Portable DataFusion-backed UDF engine with first class support for pandas dataframes- Serialize Expressions to and from YAML- Easily build Flight end-points by composing UDFsthanks for checking this out, and we’re here to answer any questions! Comments URL: https://news.ycombinator.com/item?id=43495811 Points: 8 # Comments: 0

Mar 27, 2025 - 21:04
 0
Show HN: Xorq – open-source Python-first Pandas-style pipelines

Hi HN, Dan, Hussain and Daniel here… After years of struggling with data pipelines that worked in notebooks but failed in production, we decided to do something about it. We created xorq to eliminate the constant headaches of SQL/pandas impedance mismatch, runtime debugging, wasteful recomputations and unreliable research-to-production deployments that plague traditional pandas-style pipeline workflows. xorq is built on Ibis and DataFusion.

We’d love your feedback and contributions. xorq is [Apache 2.0 licensed](https://github.com/letsql/xorq/blob/main/LICENSE) to encourage open collaboration.

Repo: https://github.com/letsql/xorq

Docs: https://docs.xorq.dev

Roadmap Issues: https://github.com/letsql/xorq

You can get started `pip install xorq`.

Or, if you use nix, you can simply run `nix run github:xorq-labs/xorq` and drop into an IPython shell.

Demo video: https://youtu.be/jUk8vrR6bCw

Here are some vignettes to look into next:

1. MCP Server + Flight + XGBoost: https://docs.xorq.dev/vignettes/mcp_flight_server

2. 1 DuckDB + 2 Writers + 1 Reader: https://docs.xorq.dev/vignettes/duckdb_concurrent

3. OpenAI UDF: https://docs.xorq.dev/tutorials/hn_data_prep

Some features to note:

- Ibis-based multi-engine expression system: effortless engine-to-engine streaming

- Cache expressions with `.cache` operator

- Portable DataFusion-backed UDF engine with first class support for pandas dataframes

- Serialize Expressions to and from YAML

- Easily build Flight end-points by composing UDFs

thanks for checking this out, and we’re here to answer any questions!


Comments URL: https://news.ycombinator.com/item?id=43495811

Points: 8

# Comments: 0