DataForge Cloud compared to DataForge Core

Select DataForge Cloud for an end-to-end scalable cloud solution, allowing your teams to focus on building data products rather than orchestration and infrastructure.

DataForge Core: Functional Transforms

DataForge Core is a lightweight framework to build functional data transformation code

However, it has limitations that make it challenging to use at scale:

  • No extract/load functionality

  • No incremental refresh

  • No circular table dependencies

  • No support for schema evolution

  • Syntax checking at compile time only

  • Manual lineage analysis

  • SQL only

  • Requires integration with other tools

DataForge Cloud:
Declarative Data Management

DataForge Cloud extends the capabilities of Core to an end-to-end solution

  • Extract, Load, Transform, and Publish

  • Change data capture and incremental loads

  • Circular and self-referencing dependencies

  • Automatic schema evolution

  • Real-time syntax checking

  • Graphical lineage interface

  • SDK for Python, Scala, R, and SQL extensions

  • Native integrations with external tools

Cloud Features:

Save over 60% on cloud and infrastructure spend

DataForge Cloud uses the most cost-effective cloud services available for every stage in processing. Save 30-60%+ on Databricks cost by leveraging Jobs Compute and 70%+ on cloud spend by using spot pricing.

Manage thousands of systems with templates

DataForge Cloud templates provide an additional layer of code management to centrally control libraries of functional transforms. Use template cloning to integrate a new copy of an existing source system in one click.

Browser-based IDE and DataOps hub

Build, manage, configure, operate, monitor, and debug all in a single integrated interface. DataForge Cloud surfaces only the most important information to help teams optimize DataOps workflows.

Enterprise support and automated upgrades

Available in both full SaaS and private cloud deployment options, DataForge Cloud comes with robust support, SLA guarantees, auditing, logging, SSO, RBAC, training, and more!

DataForge Cloud

Dataforge Cloud is the fastest and most reliable way to deploy DataForge. Develop, orchestrate, operate, and audit functional code pipelines in an all-in-one web-based UI.

DataForge Core

DataForge Core is an open source command line tool that enables teams to write functional data transformation code following software engineering best practices and principles.