Skip to content

A simple script for importing the data from the TON_IoT dataset into your elasticsearch instance with a suggested index mapping in JSON format. This script currently only features importing the Processed_Network_dataset.

License

Notifications You must be signed in to change notification settings

entewurzelauskuh/toniot-to-elastic-tool

Repository files navigation

toniot-to-elastic-tool

A simple script for importing the Processed_Network_dataset from the TON_IoT dataset into your elasticsearch instance with a suggested index mapping in JSON format.

A note beforehand

Please note: This script will NOT create an elasticsearch index for you. You need to do this beforehand yourself. The suggested elasticsearch mapping used by this script can be found in the elasticsearch_toniot_index_mapping.json file.

How to use this script

Clone this project into a directory. You'll then need to download the TON_IoT dataset from the official source. However, only the Processed_Network_dataset part is supported, so it is enough to only download this part. The dataset comes in 23 separate CSV files (Network_dataset_1.csv - Network_dataset_23.csv), which need to be placed in the data folder of this project.

Next, I recommend using a virtualenv to create a separate python instance besides your default one. To create a virtualenv, you need to install the package to create one through sudo apt-get install virtualenv and create the environment with virtualenv <env_name>. You can then activate it with source <env_name>/bin/activate and install the required packages for this script with pip install -r requirements.txt.

Finally, you can run this script with python with python run.py -u <es_user> -p <es_password> -i <es_index> [-e <es_host> -p <es_port> -m <http_method> -l]. See python run.py -h for a detailed explanation of all accepted parameters.

About

A simple script for importing the data from the TON_IoT dataset into your elasticsearch instance with a suggested index mapping in JSON format. This script currently only features importing the Processed_Network_dataset.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages