data-lineage contains two types of SDKs:
- API - connects to API server.
- DB - connects directly to the PostgreSQL database.
API SDK is the preferred choice.
Both variants are functionally equivalent.
DB SDK is better when bulk loading catalog and lineage to the database. However, the programmer has to manage the lifecycle of database sessions and transactions. DB SDK is best suited for the power user.
By default, use the API SDK.
Using Jupyter Notebooks is optional.
# Install packages
pip install data-lineage
pip install jupyter
# Checkout example notebook: http://tokern.io/docs/data-lineage/example/