Skip to content
This repository has been archived by the owner on Apr 2, 2019. It is now read-only.
/ gdr-ingest Public archive

Scripts / pipelines for slurping external datasets into Broad data repositories.

License

Notifications You must be signed in to change notification settings

broadinstitute/gdr-ingest

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

GDR Ingest

Custom logic for slurping external datasets into Broad data repositories.

Building the project

The project is built using sbt. Once the tool is installed, you can compile with:

$ cd ${PROJECT_ROOT}
$ sbt
# Much garbage logging, before dropping into the sbt repl
sbt:gdr-ingest> compile

You can also compile by running sbt compile from the project root in bash, but that will eat the build tool's startup cost on every call.

Running ENCODE ingest

From within the sbt repl, run:

sbt:gdr-ingest> encode-ingest/run --help

You can also run "help" for specific sub-commands to see their options, i.e.

sbt:gdr-ingest> encode-ingest/run download-metadata --help

Testing ENCODE ingest

There aren't any automated tests (yet). Manual testing has been done by comparing outputs to a test workspace in FireCloud containing a subset of all data.

Running the Explorer API

Running locally

First, make sure Postgres 9.6 is installed, running, and seeded with data:

$ brew install postgresql@9.6
$ echo 'export PATH="/usr/local/opt/postgresql@9.6/bin:$PATH"' >> ~/.bash_profile
$ source ~/.bash_profile
$ brew services start postgresql@9.6
$ psql postgres < encode/explorer/db/tables.sql
$ gsutil cat gs://broad-gdr-encode-transfer-tmp/snapshot.sql | psql postgres

Then, add configuration for accessing the DB to encode/explorer/src/main/resources/application.conf. The needed fields are:

  • org.broadinstitute.gdr.encode.explorer.db.username
  • org.broadinstitute.gdr.encode.explorer.db.password

Finally, from within the sbt repl run:

sbt:gdr-ingest> encode-explorer/run

If you see ASCII art, the server is successfully running.

You can then query the API by curl-ing localhost:

$ curl localhost:8080/api/facets

Running in GCP

Run encode/explorer/deploy.sh to push the API server into App Engine.

Configuring the Explorer API

Default configuration is listed in encode/explorer/src/main/resources/reference.conf. To tweak the defaults, you can either modify reference.conf or add overrides to application.conf.

About

Scripts / pipelines for slurping external datasets into Broad data repositories.

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •