Airbyte
Airbyte is a data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes. It has the largest catalog of ELT connectors to data warehouses and databases.
Installation and Setupβ
This instruction shows how to load any source from Airbyte
into a local JSON
file that can be read in as a document.
Prerequisites:
Have docker desktop
installed.
Steps:
- Clone Airbyte from GitHub -
git clone https://github.com/airbytehq/airbyte.git
. - Switch into Airbyte directory -
cd airbyte
. - Start Airbyte -
docker compose up
. - In your browser, just visit http://localhost:8000. You will be asked for a username and password. By default, that's username
airbyte
and passwordpassword
. - Setup any source you wish.
- Set destination as Local JSON, with specified destination path - lets say
/json_data
. Set up a manual sync. - Run the connection.
- To see what files are created, navigate to:
file:///tmp/airbyte_local/
.
Document Loaderβ
See a usage example.
from langchain.document_loaders import AirbyteJSONLoader