geowatch.utils.util_data module

Todo

  • [ ] make a nicer DVC registry API and CLI

  • [ ] rename

SeeAlso:

../cli/find_dvc.py python -m geowatch find_dvc list –hardware=ssd –tags=phase2_data python -m geowatch find_dvc list –hardware=hdd –tags=phase2_data python -m geowatch find_dvc list –hardware=auto –tags=phase2_data

class geowatch.utils.util_data.DataRegistry(registry_fpath=None)[source]

Bases: object

Provide a quick way of storing and querying for machine specific paths

pandas(**kwargs)[source]
list(**kwargs)[source]
add(name, path, **kwargs)[source]
set(name, path=None, **kwargs)[source]

Set an attribute of a row

remove(name)[source]
read()[source]
query(must_exist=False, **kwargs)[source]
find(on_error='raise', envvar=None, **kwargs)[source]
geowatch.utils.util_data.find_dvc_dpath(name=NoParam, on_error='raise', **kwargs)[source]

Return the location of the GeoWATCH DVC Data path if it exists and is in a “standard” location.

NOTE: other team members can add their “standard” locations if they want.

SeeAlso:

GEOWATCH_DATA_DPATH=$(geowatch_dvc)

python -m geowatch.cli.find_dvc –hardware=hdd python -m geowatch.cli.find_dvc –hardware=ssd

geowatch.utils.util_data.find_smart_dvc_dpath(*args, **kw)[source]