Command Line Interface Reference#

This page contains all commands available in the dlt CLI and is generated
automatically from the fully populated python argparse object of dlt.

Info

Flags and positional commands are inherited from the parent command. Position within the command string
is important. For example if you want to enable debug mode on the pipeline command, you need to add the
debug flag to the base dlt command:

dlt --debug pipeline

Adding the flag after the pipeline keyword will not work.

`dlt`#

Creates, adds, inspects and deploys dlt pipelines. Further help is available at https://dlthub.com/docs/reference/command-line-interface.

Usage

dlt [-h] [-v] [--non-interactive] [-y] [--debug] [--version]
    [--disable-telemetry] [--enable-telemetry] [--no-pwd]
    {telemetry,schema,pipeline,init,deploy,dashboard,ai} ...

Show Arguments and Options

Options

-h, --help - Show this help message and exit
-v, --verbose - Increase verbosity. repeat for more (-v, -vv, -vvv).
--non-interactive - Use prompt defaults; fail if a prompt has none. implied when stdin is not a tty.
-y, --yes - Auto-accept confirmations. free-form prompts still need defaults.
--debug - Show full stack traces on exceptions.
--version - Show program's version number and exit
--disable-telemetry - Disables telemetry before command is executed
--enable-telemetry - Enables telemetry before command is executed
--no-pwd - Do not add current working directory to sys.path. by default $pwd is added to reproduce python behavior when running scripts.

Available subcommands

telemetry - Shows telemetry status
schema - Shows, converts and upgrades schemas
pipeline - Inspects pipeline state, trace, load packages, provides basic maintenance
init - Creates a pipeline in the current folder by adding existing verified source or creating a new one from template.
deploy - Creates a deployment package for a selected pipeline script
dashboard - Shows the dlthub workspace dashboard
ai - Moved to dlthub ai (run pip install dlt[hub])

`dlt telemetry`#

Shows telemetry status.

Usage

dlt telemetry [-h]

Description

Shows the current status of dlt telemetry. Learn more about telemetry and what we send in our telemetry docs.

Show Arguments and Options

Inherits arguments from dlt.

Options

-h, --help - Show this help message and exit

`dlt schema`#

Shows, converts and upgrades schemas.

Usage

dlt schema [-h] [--format {json,yaml,dbml,dot,mermaid}] [--remove-defaults] file

Description

Loads, validates and prints out a dlt schema from a yaml or json file.

Show Arguments and Options

Inherits arguments from dlt.

Positional arguments

file - Schema file name, in yaml or json format, will autodetect based on extension

Options

-h, --help - Show this help message and exit
--format {json,yaml,dbml,dot,mermaid} - Display schema in this format
--remove-defaults - Does not show default hint values

`dlt pipeline`#

Inspects pipeline state, trace, load packages, provides basic maintenance.

Usage

dlt pipeline [-h] [--list-pipelines] [--pipelines-dir PIPELINES_DIR]
    [pipeline_name]
    {info,show,failed-jobs,drop-pending-packages,sync,trace,schema,drop,load-package,mcp}
    ...

Description

Provides tools to inspect the pipeline working directory, tables, and data in the destination, and to check for problems encountered during data loading.

Show Arguments and Options

Inherits arguments from dlt.

Positional arguments

pipeline_name - Pipeline name

Options

-h, --help - Show this help message and exit
--list-pipelines, -l - List local pipelines
--pipelines-dir PIPELINES_DIR - Pipelines working directory

Available subcommands

info - Displays state of the pipeline, use -v or -vv for more info
show - Generates and launches workspace dashboard with the loading status and dataset explorer
failed-jobs - Displays information on all the failed loads in all completed packages, failed jobs and associated error messages
drop-pending-packages - Deletes all extracted and normalized packages including those that are partially loaded.
sync - Drops the local state of the pipeline and resets all the schemas and restores it from destination. the destination state, data and schemas are left intact.
trace - Displays last run trace, use -v or -vv for more info
schema - Displays default schema
drop - Selectively drop tables and reset state
load-package - Displays information on load package, use -v or -vv for more info
mcp - Launch mcp server attached to this pipeline

`dlt pipeline info`#

Displays state of the pipeline, use -v or -vv for more info.

Usage

dlt pipeline [pipeline_name] info [-h]

Description

Displays the content of the working directory of the pipeline: dataset name, destination, list of
schemas, resources in schemas, list of completed and normalized load packages, and optionally a
pipeline state set by the resources during the extraction process.

Show Arguments and Options

Inherits arguments from dlt pipeline.

Options

-h, --help - Show this help message and exit

`dlt pipeline show`#

Generates and launches workspace dashboard with the loading status and dataset explorer.

Usage

dlt pipeline [pipeline_name] show [-h] [--edit]

Description

Launches the workspace dashboard with a comprehensive interface to inspect the pipeline state, schemas, and data in the destination.

This dashboard should be executed from the same folder from which you ran the pipeline script to be able access destination credentials.

If the --edit flag is used, will launch the editable version of the dashboard if it exists in the current directory, or create this version and launch it in edit mode.

Requires marimo to be installed in the current environment: pip install marimo.

Show Arguments and Options

Inherits arguments from dlt pipeline.

Options

-h, --help - Show this help message and exit
--edit - Creates editable version of workspace dashboard in current directory if it does not exist there yet and launches it in edit mode.

`dlt pipeline failed-jobs`#

Displays information on all the failed loads in all completed packages, failed jobs and associated error messages.

Usage

dlt pipeline [pipeline_name] failed-jobs [-h]

Description

This command scans all the load packages looking for failed jobs and then displays information on
files that got loaded and the failure message from the destination.

Show Arguments and Options

Inherits arguments from dlt pipeline.

Options

-h, --help - Show this help message and exit

`dlt pipeline drop-pending-packages`#

Deletes all extracted and normalized packages including those that are partially loaded.

Usage

dlt pipeline [pipeline_name] drop-pending-packages [-h]

Description

Removes all extracted and normalized packages in the pipeline's working dir.
dlt keeps extracted and normalized load packages in the pipeline working directory. When the run method is called, it will attempt to normalize and load
pending packages first. This command removes such packages. Note that pipeline state is not reverted to the state at which the deleted packages
were created. Using the sync sub-command is recommended if your destination supports state sync.

Show Arguments and Options

Inherits arguments from dlt pipeline.

Options

-h, --help - Show this help message and exit

`dlt pipeline sync`#

Drops the local state of the pipeline and resets all the schemas and restores it from destination. The destination state, data and schemas are left intact.

Usage

dlt pipeline [pipeline_name] sync [-h] [--destination DESTINATION]
    [--dataset-name DATASET_NAME]

Description

This command will remove the pipeline working directory with all pending packages, not synchronized
state changes, and schemas and retrieve the last synchronized data from the destination. If you drop
the dataset the pipeline is loading to, this command results in a complete reset of the pipeline state.

In case of a pipeline without a working directory, this command may be used to create one from the
destination. In order to do that, you need to pass the dataset name and destination name to the CLI
and provide the credentials to connect to the destination (i.e., in .dlt/secrets.toml) placed in the
folder where you run it.

Show Arguments and Options

Inherits arguments from dlt pipeline.

Options

-h, --help - Show this help message and exit
--destination DESTINATION - Sync from this destination when local pipeline state is missing.
--dataset-name DATASET_NAME - Dataset name to sync from when local pipeline state is missing.

`dlt pipeline trace`#

Displays last run trace, use -v or -vv for more info.

Usage

dlt pipeline [pipeline_name] trace [-h]

Description

Displays the trace of the last pipeline run containing the start date of the run, elapsed time, and the
same information for all the steps (extract, normalize, and load). If any of the steps failed,
you'll see the message of the exceptions that caused that problem. Successful load and run steps
will display the load info instead.

Show Arguments and Options

Inherits arguments from dlt pipeline.

Options

-h, --help - Show this help message and exit

`dlt pipeline schema`#

Displays default schema.

Usage

dlt pipeline [pipeline_name] schema [-h] [--format {json,yaml,dbml,dot,mermaid}]
    [--remove-defaults]

Description

Displays the default schema for the selected pipeline.

Show Arguments and Options

Inherits arguments from dlt pipeline.

Options

-h, --help - Show this help message and exit
--format {json,yaml,dbml,dot,mermaid} - Display schema in this format
--remove-defaults - Does not show default hint values

`dlt pipeline drop`#

Selectively drop tables and reset state.

Usage

dlt pipeline [pipeline_name] drop [-h] [--destination DESTINATION]
    [--dataset-name DATASET_NAME] [--drop-all] [--state-paths [STATE_PATHS ...]]
    [--schema SCHEMA_NAME] [--state-only] [resources ...]

Description

Selectively drop tables and reset state.

dlt pipeline <pipeline name> drop [resource_1] [resource_2]

Drops tables generated by selected resources and resets the state associated with them. Mainly used
to force a full refresh on selected tables. In the example below, we drop all tables generated by
the repo_events resource in the GitHub pipeline:

dlt pipeline github_events drop repo_events

dlt will inform you of the names of dropped tables and the resource state slots that will be
reset:

About to drop the following data in dataset airflow_events_1 in destination dlt.destinations.duckdb:
Selected schema:: github_repo_events
Selected resource(s):: ['repo_events']
Table(s) to drop:: ['issues_event', 'fork_event', 'pull_request_event', 'pull_request_review_event', 'pull_request_review_comment_event', 'watch_event', 'issue_comment_event', 'push_event__payload__commits', 'push_event']
Resource(s) state to reset:: ['repo_events']
Source state path(s) to reset:: []
Do you want to apply these changes? [y/N]

As a result of the command above the following will happen:

All the indicated tables will be dropped in the destination. Note that dlt drops the nested
tables as well.
All the indicated tables will be removed from the indicated schema.
The state for the resource repo_events was found and will be reset.
New schema and state will be stored in the destination.

The drop command accepts several advanced settings:

You can use regexes to select resources. Prepend the re: string to indicate a regex pattern. The example
below will select all resources starting with repo:

dlt pipeline github_events drop "re:^repo"

You can drop all tables in the indicated schema:

dlt pipeline chess drop --drop-all

You can indicate additional state slots to reset by passing JsonPath to the source state. In the example
below, we reset the archives slot in the source state:

dlt pipeline chess_pipeline drop --state-paths archives

This will select the archives key in the chess source.

{
  "sources":{
    "chess": {
      "archives": [
        "https://api.chess.com/pub/player/magnuscarlsen/games/2022/05"
      ]
    }
  }
}

This command is still experimental and the interface will most probably change.

Show Arguments and Options

Inherits arguments from dlt pipeline.

Positional arguments

resources - One or more resources to drop. can be exact resource name(s) or regex pattern(s). regex patterns must start with re:

Options

-h, --help - Show this help message and exit
--destination DESTINATION - Sync from this destination when local pipeline state is missing.
--dataset-name DATASET_NAME - Dataset name to sync from when local pipeline state is missing.
--drop-all - Drop all resources found in schema. supersedes [resources] argument.
--state-paths [STATE_PATHS ...] - State keys or json paths to drop
--schema SCHEMA_NAME - Schema name to drop from (if other than default schema).
--state-only - Only wipe state for matching resources without dropping tables.

`dlt pipeline load-package`#

Displays information on load package, use -v or -vv for more info.

Usage

dlt pipeline [pipeline_name] load-package [-h] [load-id]

Description

Shows information on a load package with a given load_id. The load_id parameter defaults to the
most recent package. Package information includes its state (COMPLETED/PROCESSED) and list of all
jobs in a package with their statuses, file sizes, types, and in case of failed jobs—the error
messages from the destination. With the verbose flag set (-v), you can also see the
list of all tables and columns created at the destination during the loading of that package.

Show Arguments and Options

Inherits arguments from dlt pipeline.

Positional arguments

load-id - Load id of completed or normalized package. defaults to the most recent package.

Options

-h, --help - Show this help message and exit

`dlt pipeline mcp`#

Launch MCP server attached to this pipeline.

Usage

dlt pipeline [pipeline_name] mcp [-h] [--stdio] [--sse] [--port PORT]
    [--features [FEATURES ...]]

Description

This MCP facilitates schema and data exploration for the dataset created with this pipeline.

Show Arguments and Options

Inherits arguments from dlt pipeline.

Options

-h, --help - Show this help message and exit
--stdio - Use stdio transport mode
--sse - Use legacy sse transport instead of streamable-http
--port PORT - Port for the mcp server (default: 43656)
--features [FEATURES ...] - Mcp features to enable/disable. default: context, pipeline, secrets, toolkit, workspace. use +name to add, -name to remove (e.g. --features=-secrets,+context)

`dlt init`#

Creates a pipeline in the current folder by adding existing verified source or creating a new one from template.

Usage

dlt init [-h] [--list-sources] [--list-destinations] [--location LOCATION]
    [--branch BRANCH] [--eject] [source] [destination]

Description

This command creates a new dlt pipeline script that loads data from source to destination. When you run the command, several things happen:

Creates a basic project structure if the current folder is empty by adding .dlt/config.toml, .dlt/secrets.toml, and .gitignore files.
Checks if the source argument matches one of our verified sources and, if so, adds it to your project.
If the source is unknown, uses a generic template to get you started.
Rewrites the pipeline scripts to use your destination.
Creates sample config and credentials in secrets.toml and config.toml for the specified source and destination.
Creates requirements.txt with dependencies required by the source and destination. If one exists, prints instructions on what to add to it.

This command can be used several times in the same folder to add more sources, destinations, and pipelines. It will also update the verified source code to the newest
version if run again with an existing source name. You will be warned if files will be overwritten or if the dlt version needs an upgrade to run a particular pipeline.

Show Arguments and Options

Inherits arguments from dlt.

Positional arguments

source - Name of data source for which to create a pipeline. adds existing verified source or creates a new pipeline template if verified source for your data source is not yet implemented.
destination - Name of a destination i.e. bigquery or redshift

Options

-h, --help - Show this help message and exit
--list-sources, -l - Shows all available verified sources and their short descriptions. for each source, it checks if your local dlt version requires an update and prints the relevant warning.
--list-destinations - Shows the name of all core dlt destinations.
--location LOCATION - Advanced. uses a specific url or local path to verified sources repository.
--branch BRANCH - Advanced. uses specific branch of the verified sources repository to fetch the template.
--eject - Ejects the source code of the core source like sql_database or rest_api so they will be editable by you.

`dlt deploy`#

Creates a deployment package for a selected pipeline script.

Usage

dlt deploy [-h] pipeline-script-path {github-action,airflow-composer} ...

Description

Prepares your pipeline for deployment and gives you step-by-step instructions on how to accomplish it. To enable this functionality, please first execute pip install "dlt[cli]" which adds additional packages to the current environment.

Show Arguments and Options

Inherits arguments from dlt.

Positional arguments

pipeline-script-path - Path to a pipeline script

Options

-h, --help - Show this help message and exit

Available subcommands

github-action - Deploys the pipeline to github actions
airflow-composer - Deploys the pipeline to airflow

`dlt deploy github-action`#

Deploys the pipeline to Github Actions.

Usage

dlt deploy pipeline-script-path github-action [-h] [--location LOCATION]
    [--branch BRANCH] --schedule SCHEDULE [--run-manually] [--run-on-push]

Description

Deploys the pipeline to GitHub Actions.

GitHub Actions (https://github.com/features/actions) is a CI/CD runner with a large free tier which you can use to run your pipelines.

You must specify when the GitHub Action should run using a cron schedule expression. The command also takes additional flags:
--run-on-push (default is False) and --run-manually (default is True). Remember to put the cron
schedule expression in quotation marks.

For the chess.com API example from our docs, you can deploy it with dlt deploy chess.py github-action --schedule "*/30 * * * *".

Follow the guide on how to deploy a pipeline with GitHub Actions in our documentation for more information.

Show Arguments and Options

Inherits arguments from dlt deploy.

Options

-h, --help - Show this help message and exit
--location LOCATION - Advanced. uses a specific url or local path to pipelines repository.
--branch BRANCH - Advanced. uses specific branch of the deploy repository to fetch the template.
--schedule SCHEDULE - A schedule with which to run the pipeline, in cron format. example: '*/30 * * * *' will run the pipeline every 30 minutes. remember to enclose the scheduler expression in quotation marks!
--run-manually - Allows the pipeline to be run manually form github actions ui.
--run-on-push - Runs the pipeline with every push to the repository.

`dlt deploy airflow-composer`#

Deploys the pipeline to Airflow.

Usage

dlt deploy pipeline-script-path airflow-composer [-h] [--location LOCATION]
    [--branch BRANCH] [--secrets-format {env,toml}]

Description

Google Composer (https://cloud.google.com/composer?hl=en) is a managed Airflow environment provided by Google. Follow the guide in our docs on how to deploy a pipeline with Airflow to learn more. This command will:

create an Airflow DAG for your pipeline script that you can customize. The DAG uses
the dlt Airflow wrapper (https://github.com/dlt-hub/dlt/blob/devel/dlt/helpers/airflow_helper.py#L37) to make this process trivial.
provide you with the environment variables and secrets that you must add to Airflow.
provide you with a cloudbuild file to sync your GitHub repository with the dag folder of your Airflow Composer instance.

Show Arguments and Options

Inherits arguments from dlt deploy.

Options

-h, --help - Show this help message and exit
--location LOCATION - Advanced. uses a specific url or local path to pipelines repository.
--branch BRANCH - Advanced. uses specific branch of the deploy repository to fetch the template.
--secrets-format {env,toml} - Format of the secrets

`dlt dashboard`#

Shows the dlthub workspace dashboard.

Usage

dlt dashboard [-h] [--pipelines-dir PIPELINES_DIR] [--edit]

Description

This command shows the dlt workspace dashboard. You can use the dashboard:

to list and inspect local pipelines
browse the full pipeline schema and all hints
browse the data in the destination
inspect the pipeline state.

Show Arguments and Options

Inherits arguments from dlt.

Options

-h, --help - Show this help message and exit
--pipelines-dir PIPELINES_DIR - Pipelines working directory
--edit - Eject dashboard and start editable version

`dlt ai`#

Moved to dlthub ai (run pip install dlt[hub]).

Usage

dlt ai [-h]

Description

ai command moved to dlthub, pip install dlt[hub] and dlthub ai to use.

Show Arguments and Options

Inherits arguments from dlt.

Options

-h, --help - Show this help message and exit

Command Line Interface Reference#

dlt#

dlt telemetry#

dlt schema#

dlt pipeline#

dlt pipeline info#

dlt pipeline show#

dlt pipeline failed-jobs#

dlt pipeline drop-pending-packages#

dlt pipeline sync#

dlt pipeline trace#

dlt pipeline schema#

dlt pipeline drop#

dlt pipeline load-package#

dlt pipeline mcp#

dlt init#

dlt deploy#

dlt deploy github-action#

dlt deploy airflow-composer#

dlt dashboard#

dlt ai#

`dlt`#

`dlt telemetry`#

`dlt schema`#

`dlt pipeline`#

`dlt pipeline info`#

`dlt pipeline show`#

`dlt pipeline failed-jobs`#

`dlt pipeline drop-pending-packages`#

`dlt pipeline sync`#

`dlt pipeline trace`#

`dlt pipeline schema`#

`dlt pipeline drop`#

`dlt pipeline load-package`#

`dlt pipeline mcp`#

`dlt init`#

`dlt deploy`#

`dlt deploy github-action`#

`dlt deploy airflow-composer`#

`dlt dashboard`#

`dlt ai`#