mantik.data_repository.data_repository

Attributes

logger

Functions

data_download(→ str)

Downloads a GIT based data repository, along with its DVC files.

download_data_repository(→ str)

Downloads a GIT based data repository, along with its DVC files.

dvc_pull_with_aws_credentials(aws_access_key_id, ...)

Perform DVC pull using S3 as a DVC backend

git_clone_with_checkout(git_uri, checkout, target_dir)

Make target folder, git clone, and checkout a specific commit.

verify_dvc_is_installed()

Module Contents

mantik.data_repository.data_repository.data_download(project_id: uuid.UUID | None = None, data_repository_id: uuid.UUID | None = None, branch: str | None = None, commit: str | None = None, target_dir: pathlib.Path | None = None, mantik_access_token: str | None = None) str[source]

Downloads a GIT based data repository, along with its DVC files.

Is meant to be used in the training script. import mantik mantik.data_download()

mantik.data_repository.data_repository.download_data_repository(project_id: uuid.UUID, data_repository_id: uuid.UUID, checkout: str | None, target_dir: pathlib.Path, token: str) str[source]

Downloads a GIT based data repository, along with its DVC files.

mantik.data_repository.data_repository.dvc_pull_with_aws_credentials(aws_access_key_id: str, aws_secret_access_key: str, target_dir: pathlib.Path | None = None)[source]

Perform DVC pull using S3 as a DVC backend

mantik.data_repository.data_repository.git_clone_with_checkout(git_uri: str, checkout: str | None, target_dir: pathlib.Path)[source]

Make target folder, git clone, and checkout a specific commit.

mantik.data_repository.data_repository.logger[source]
mantik.data_repository.data_repository.verify_dvc_is_installed()[source]