Firebolt Cloud Data Warehouse

Finally, a cloud data warehouse that can handle the scale you need with the performance you always wanted.

Firebolt was designed to deliver the combined improvements in speed, scale, and efficiency needed to cost-effectively deliver fast operational and customer-facing analytics.

Firebolt fits into your modern data stack, and was designed to help you ship production-grade data apps quickly and reliably. Orchestrate everything programmatically end-to-end through Firebolt’s APIs and SDKs, and even enjoy a web IDE you’ll actually want to keep using.

Firebolt’s modern cloud data warehouse was built for AWS, and leverages best of breed infrastructure components to deliver best price-performance for analytics.

Modern Architecture

Firebolt was architected from the ground up to leverage the benefits of the cloud, so you can build robust and scalable analytics experiences that will last.

Firebolt is a columnar Data Warehouse built for low-latency analytics workloads at TB++ scale with built-in storage optimization. As a SaaS offering, there are no server or software upgrades to manage, lowering operational overhead and costs.

Figure 1. Firebolt decoupled storage-compute architecture

Decoupled Storage and Compute

The promise of endless compute and unlimited storage in the cloud sounds great. But when delivering fast analytics, this could require huge clusters with expensive block storage and it becomes unaffordable for most. Firebolt is not just fast, but also introduces new levels of hardware efficiency. This means that you can deliver sub-second TB++ scale analytics using small and affordable compute clusters.

Firebolt’s decoupled storage and compute architecture allows you to easily scale as your data challenges grow, without the traditional complexities. Firebolt’s unique “F3” file format is optimized for sub-second performance while reducing capacity and cost through the use of columnar compression on object storage. Additionally, granular compute and scale-up/scale-out/scale-in/scale-to-zero of Firebolt engines provides the ultimate control in data warehouse infrastructure provisioning.

With decoupled storage and compute, Firebolt makes it easy to spin up dev/test environments or scale your production workloads, all the while maintaining workload and resource isolation. Accessing data or Ingesting data from object storage is as simple as “insert into table select * from object_storage”.

Figure 2. Firebolt Decoupled compute and storage

Sub-second Analytics

As customers build data warehouse based data apps, the need for sub-second performance and high number of queries per second are challenging. Most analytics platforms queue requests due to concurrency limitations or use auto scaling to add clusters or worker nodes to increase concurrency and QPS. Firebolt takes a different approach by focusing on intelligently pruning data retrieved and scanned, Firebolt is able to consume less resources, deliver sub-second analytics and high concurrency at the same time.

Firebolt leverages various types of indexes to deliver fast, high concurrency analytics. Additionally, customers have the option of adding multiple engines, each with scale-out capabilities, to increase concurrency, resource and workload isolation.

Building Next Generation Data Apps

Data Apps extend analytics beyond traditional Business Intelligence and embedded analytics. Engineering and Developments teams collaborate to create Data Apps. Firebolt is designed for Data Apps with the following capabilities.

Leverage decoupled compute and storage to not only scale but provide workload isolation as well. This ensures quality of service.
Ability to spin-up / spin-down environments rapidly using SQL, enhancing developer productivity.
Low latency and High concurrency are must have attributes for data apps. Firebolt achieves this with efficiently sized infrastructure.
ANSI-SQL and strong database fundamentals to simplify development of Data Apps.

Data Warehousing Costs

At Firebolt, we believe that modern data engineering & dev teams value the ability to control what they’re running. This is why Firebolt lets you be very granular in your compute resource choices, letting you choose different CPU, SSD, and RAM combinations. This will allow you to optimize your resources for the best price-performance ratio, without over-spending on under-utilized resources. Performance optimization gets the most out of your object storage with no additional uplift. Firebolt’s pricing model is simple, transparent without any tier uplifts.

Query Optimization

Firebolt Query optimizer parses each query, builds a logical query plan, and then applies plan transformations and rewrites to derive the best physical query plan. Firebolt’s use of sparse indexing combined with F3 file format reduces the amount of data fetched over the network by over 10X or more enabling sub-second analytics.

Further, the query optimizer evaluates whether query performance can be improved by reordering query operations, or by using indexes in place of operations, reordering operations to take advantage of predicate pushdowns, and uses aggregating and join indexes to further reduce both data access and scans. This helps reduce data sets before accessing data or performing joins boosting performance with both star and snowflake schemas.

Figure 4. Query Optimization in Firebolt

Fast Indexing

Firebolt uses indexing extensively to improve performance. Each table has a sparse (primary) index consisting of any number of columns in any order. In addition, there are indexes that help with aggregations or joins. Indexes dramatically improve performance and overcome some of the limitations of decoupled storage and compute. They are also updated and optimized during ingestion eliminating the need for additional management.

This optimization of storage and compute with indexing reduces both remote data access and the amount of data cached by 10x or more, which also leads to 10x less data to scan and process.

Figure 5. Specialized indexes in Firebolt

Semi Structured / JSON

JSON feels at home in Firebolt. No need to flatten your semi-structured data in lengthy ETL processes - Just ingest it natively into Firebolt. You can query it directly with SQL and traditional UNNEST, or enjoy even more flexibility and efficiency without unnesting and using our Lambda-like syntax and array manipulation options.

SQL and Ecosystem Integration

Firebolt speaks ANSI-SQL, and sports a SQL IDE that you’ll actually enjoy using. We won’t discourage you from using joins and won’t encourage you to denormalize your data - it’s easy to remain fast with Firebolt.

Additionally, Firebolt partners with modern data stack solutions that harness Firebolt’s speed, scale and efficiency to extend what customers can achieve using complementary integrations and capabilities.

ecosystem integration diagram — Figure 6. Ecosystem integrations diagram

Firebolt Performance Advantage

Results below illustrate Firebolt performance gains with smaller and cheaper clusters Vs Snowflake customer. Actual customer results with a 0.5TB dataset.

Our Customers Say

Firebolt immediately gave us faster performance at a much greater scale, which let our customers analyze huge datasets with sub-second performance.
It also gave us the flexibility to deliver complex data features at a much faster pace

Yoav Shmaria
VP R&D, Platform.
SimilarWeb

Firebolt has outperformed all other data serving SQL engines we have tested. It is built by engineers - for engineers

Roy Miara
Engineering Manager
Explorium

With Firebolt, our 1000 Looker users can now run any analytics against billions of rows and terabytes of data, in seconds or less.

Alexandra Sudilovski
S. BI Expert & Looker Guild M.
AppsFlyer

We improved performance by using Firebolt, and our reporting became nearly real-time - below one-hour latency in reports.

Nayan Mevada
‍CTO
Infy.tv

A feature-set that gets the job done. Fast.

Sub-second, High Concurrency Performance

Provides sub-second latency at TB++ scale and enables high concurrency workloads

Cloud-native Architecture

Leverages Cloud Elasticity and Scale while providing granular provisioning.

Decoupled Compute & Storage

Data warehouse data is stored separately from compute. This enables spinning up multiple compute engines to process the data. For example, two competing workloads can be run on separate engines while accessing the same data on optimized object storage.

Specialized Indexes for Performance

Indexes enable effective pruning of data for access, joins and aggregations. Index maintenance is abstracted away from the administrator.

SaaS

Software as a Service eliminates server, software maintenance. Customers can focus on bringing and using their data immediately, reducing time to value significantly.

Transparent pricing

Simplified pricing based on compute and object storage consumption. No tier upgrades or unpredictable add-ons.

Integration with Modern Data Stack

Firebolt extends its functionality through ecosystem partners such as Airflow, DBT, Tableau, Looker and others.

Speaks ANSI SQL

Firebolt leverages ANSI-SQL to support SQL functionality.

Hit us with your most challenging use case

‍

The Fast Cloud Data Warehouse for Analytics