Data analysis examples for the eMOTIONAL Cities project. Data used in the notebooks has been made publicly available in Amazon Simple Storage Service (S3) buckets.
More information about sample data sharing can be found in the eMOTIONAL Cities data-share repository.
Visual Studio Code: All notebooks have been tested in Visual Studio Code on a Windows platform. Tests in other platforms and environments are forthcoming and will be added here.
- Open project folder in VS Code
- Install miniconda (Python 3.9)
- Install Python Extension for VS Code
- Create environment from VS Code:
Ctrl+Shift+P
>Create Environment
- Select
.conda
environment
- Make sure correct environment is selected in the notebook
The current notebook requires Python 3.9+ to run successfully. The file environment.yml
contains the list of minimal package dependencies required.
Before trying to export datasets, make sure you can run the notebooks on an example dataset to validate all dependencies and required environment configuration is valid.
- Open a python activated command line in the folder
src/ingestion
. - Run the
export.py
module:
python -m export <DATA_ROOT_PATH> --contacts contacts.json
The contacts.json
file provides metadata about institutional contacts that should be attached to the data export. An example file is provided below:
{
"contacts": [
{
"name": "FirstName LastName",
"institution": "Contoso",
"email": "name1@example.com"
},
{
"name": "AnotherName AnotherLast",
"institution": "Contoso",
"email": "name2@example.com"
}
]
}
Note
If your dataset contains missing UBX synchronization signals you can provide fallback schemas to be used in case of failure, e.g.
python -m export <DATA_ROOT_PATH> --contacts contacts.json --schema outdoor missing_sync