Airbyte Provides Open Source Data Integration Platform for Data Lakes


Airbyte, creators of a fast-growing open-source data integration platform, is releasing an open-source data integration for data lakes, enabling AWS users to replicate data from anywhere to their Amazon Simple Storage Service (S3) account.

Companies are now able to leverage Airbyte’s 75-plus pre-built connectors, or build their own custom connectors within two hours using Airbyte’s Connector Development Kit (CDK), in order to replicate their data to S3.

It makes it possible for businesses to access all of their data consolidated in their data lake for analytics and any other use case, according to the vendor.

S3 is the first destination offered by Airbyte, but the data lakes of other cloud providers and Delta Lake will soon follow.

By commoditising data integration, Airbyte is establishing the new standard of moving and consolidating data from different sources to data warehouses, data lakes, or databases in a process referred to as extract, load, and, when desired, transform (ELT).

Businesses can create data pipelines from sources such as PostgreSQL, MySQL, Facebook Ads, Salesforce, and Stripe, and connect to destinations that include Redshift, Snowflake, and BigQuery.

Also Read: DMP, Data Lake and Data Warehouse – What is The Fuss All About?

To date, there are 75 connectors that Airbyte is certifying to ensure they are production ready. By the end of this year, the company anticipates it will reach 200 connectors, which would be the most pre-built connectors in the market. It recently introduced its Connector Development Kit (CDK) in order to enable its user community to accelerate development and quickly address the long tail of connectors.

The Airbyte connectors run in Docker containers, which means they can be deployed in minutes on any cloud platform. It also enables connectors to be built in any programming language.