Bases: object
Class which manages all the datasets in the PlayMolecule backend
session (Session object) – A Session object
Download a dataset from the backend
datasetid (int) – The ID of the dataset we want to download
path (str) – The location to which to download the dataset
tmpdir (str) – A location to store temporary data. If set to None, the default is /tmp/
attempts (int) – Number of times to attempt uploading the file. Can help with unstable connections
_logger (bool) – Set to False to reduce the verbosity
Examples
>>> dc.download_dataset(182, "./dataset_182/")
Returns tags associated with a dataset
Get a list of datasets filtered by various arguments
public (bool) – If set to True it will only return public datasets. If set to False it will only return private datasets. If set to None this parameter will be ignored.
datasetid (int) – The ID of a specific dataset for which we want to retrieve information
remotepath (str) – The remote (virtual) path at which the dataset is located
useronly (bool) – Returns only datasets of the currently logged in user of the Session
startswith (str) – Returns any datasets whose remote (virtual) path starts with the specific string
tags (list) – Returns only datasets which have the specified tags
taggedonly (bool) – If set to True it will only return datasets which have tags
completedonly (bool) – If set to True it will only return datasets whose jobs have completed successfully
group (str) – Only get datasets related to a job group
_logger (bool) – Set to False to reduce verbosity
datasetlist – A list of datasets retrieved with the above filters
Examples
>>> datasets = dc.get_datasets(remotepath="KdeepTrainer/models/PDBBind2019")
>>> datasets = dc.get_datasets(tags=["app:kdeep"])
Removes (deletes) a dataset from the backend
Removes a tag attached to a dataset
Uploads a dataset to the backend data center
localpath (str) – The location of the file we want to upload
remotepath (str) – The remote (virtual) location to which the file should be uploaded
comments (str) – Comments to attach to the specific dataset
public (bool) – Set to True to make the dataset public (available to all users)
execid (str) – Optionally you can relate this dataset to a specific job execution by passing it’s job ID
overwrite (bool) – Set to True to overwrite existing datasets at the specified remote (virtual) location
tags (list of str) – A list of tags to attach to the specific dataset
tmpdir (str) – Location to use for creating the upload archive file. The file will be deleted after uploading. If set to None it will use /tmp/
attempts (int) – Number of times to attempt uploading the file. Can help with unstable connections
_logger (bool) – Set to False to reduce the verbosity
Bases: object
Download the dataset from the backend
Examples
>>> ds.download("./dataset_data/")