The future is functional
DataForge is the first framework to enable purely functional data transformation code. Avoid pitfalls associated with procedural pipelines and apply software engineering best practices such as DRY, composability, and extensibility. With DataForge’s declarative approach, you can focus on what to build, not how.
The next generation of data transformation
No more boiler plate or
copy-paste
Develop Faster
Replace verbose SQL statements and DDL with functional snippets that automate dependencies, manage tables/views, and check for duplicate statements. Use the DataForge Cloud IDE to leverage auto-complete, real-time syntax checking, and templates to define transformations quickly with less code.
Add logic without editing existing pipelines
Extend with ease
Leave production code in place by adding functions rather than editing existing code. Add new logic at any point in the chain without the need for detailed code analysis or regression testing.
Code adapts automatically with data changes
Automate evolution
Schema evolution and metadata services provided by DataForge Cloud generate and adjust code real-time when columns are added or changed in source systems. No need to update pipelines or backfill missing elements.
Consistent patterns and standard designs
Simplify governance
Functional code is inherently more consistently designed than procedural scripts. Leverage CI/CD integrations and the DataForge Cloud observability database to quickly and easily review and query code snippets to check for inconsistent patterns and practices.
DataForge Cloud
Dataforge Cloud is the fastest and most reliable way to deploy DataForge. Develop, orchestrate, operate, and audit functional code pipelines in an all-in-one web-based UI.
DataForge Core
DataForge Core is an open source command line tool that enables teams to write functional data transformation code following software engineering best practices and principles.