This is an interactive data visualization of the trajectory of high school recruits and their nba draft position. Check out the demo here.
In this repo you will find the raw JSON data for the visualization, the scraper used to gather this data, as well as the frontend for the chart.
- Make
- Npm
- Node.js v4+
- Gulp
- Python3
- Pip3/Requests
- Pip3/BeautifulSoup4
Run the following to compile and run the visualization, access it on http://localhost:3000
$ git clone https://github.com/jaxgeller/high-school-recruits-analytics
$ cd high-school-recruits-analytics
$ cd www && npm install
$ cd ../ && make build
Data is included in this repo, but if you'd like to build from scratch run
$ make raw
This may take a while. It gathers data from different sources, 247 Sports (for HS rankings), Wikipedia (for draft data), and ESPN NBA (for stats and pictures).
After this is done, you'll need to transform the raw data into separate years for use with the sankey plot. Run
$ make format
This yields individual year datasets, as well as nodes and links for the sankey plot.
The data can be used and downloaded from the data directory in this repo. Raw data is encoded as JSON and contains all the meta data about players. If there is high enough demand, I can add a csv of the raw player data for use in other plots. Track this issue if interested.
All contributions are welcome. If there are issues with the site, data, or tools asociated with it, please open up an issue or submit a pull request.
I'd like to thank Daniel Christian for the write up on the data. He did an excellent job writing an anecdotal analysis focussing on some of the busts and booms.
I'd also like to thank Chris Allen for the amazing and beautiful design of the site.