Step 1 - Scraping

Built a web application that scrapes various websites for data related to the Mission to Mars and displays the information in a single HTML page.

Step 1 - Scraping

Initital scraping was done using Jupyter Notebook, BeautifulSoup, Pandas, and Requests/Splinter.

Created a Jupyter Notebook file called mission_to_mars.ipynb and used it to complete all scraping and analysis tasks.

The following outlines what was scraped:

NASA Mars News

The first scrape is from the Nasa Mars site and consisted of the both the latest News Title and Paragraph Text.

JPL Mars Space Images - Featured Image

The second srape is of the JPL Featured Space Image. Splinter was used to navigate the site and find the image url for the current Featured Mars Image. Once found, the image url string was assigned a variable called featured_image_url.

Mars Facts

The third scrape is from the Mars Facts webpage and is a table of relevant facts. Pandas was used to scrape the table containing facts about the planet and data was then converted to an HTML table string.

Mars Hemispheres

The third scrape is from the USGS Astrogeology site and consists of high resolution images for each of Mar's hemispheres. A python dictionary was used to store the the data with a list containing one dictionary for each hemisphere. A for loop was used to append each dictionary with the hemisphere title and url string.

Step 2 - MongoDB and Flask Application

MongoDB with Flask templating was used to create a new HTML page that displays all of the scraped information collected above.

The initial jupyter notebook was converted into a Python script called scrape_mars.py with a function called scrape that executes the code above and returns one Python dictionary containing all of the scraped data.

Next, a route called scrape_mars.py was created to call the scrape function and the respective data related to Mission to Mars was returned and stored in Mongo as a Python dictionary.

Lastly, an html template was created to display the data passed on from the Mongo database.

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
images		images
templates		templates
.DS_Store		.DS_Store
README.md		README.md
app.py		app.py
final_html_image.png		final_html_image.png
mars_styles.css		mars_styles.css
mars_table.html		mars_table.html
mission_to_mars.ipynb		mission_to_mars.ipynb
scrape_mars.py		scrape_mars.py
table.html		table.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Step 1 - Scraping

The following outlines what was scraped:

NASA Mars News

JPL Mars Space Images - Featured Image

Mars Facts

Mars Hemispheres

Step 2 - MongoDB and Flask Application

About

Releases

Packages

Languages

CMDenys/web-scraping-to-mars

Folders and files

Latest commit

History

Repository files navigation

Step 1 - Scraping

The following outlines what was scraped:

NASA Mars News

JPL Mars Space Images - Featured Image

Mars Facts

Mars Hemispheres

Step 2 - MongoDB and Flask Application

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages