GitHub - seyedmahdiamin1998/catawiki: Project to crawl data from the catawiki website.

Catawiki Crawler

Catawiki Crawler is a project to crawl data from the Catawiki website. In this project, I focus on extracting car ad data.

Catawiki is the most-visited curated marketplace in Europe for special objects, offering over 65,000 objects for auction each week. Their mission is to provide an exciting and seamless experience to their customers for buying and selling special, hard-to-find objects. website: [https://www.catawiki.com/en/]

Intallation

$ pip install -r requirements.txt

or if you use pipenv for managing virtual environments you can either install dependencies by code below.

$ pipenv install

Initiallization

First of all you need to go to file pipelines.py and customize information of your Postgresql database.

self.USER = 'postgres'
self.PASSWORD = '1234'

Or if you want to connect to other databases like SQLite, MySQL , oracle, Microsoft SQL Server, or other databases which sqlalchemy support them, change the value of SQLALCHEMY_DATABASE_URL.

Now you are ready to run the spider which name is car.

$ cd catawiki
$ scrapy crawl car

Save results as json/csv/xml

If you want to save results as a json, csv or xml file, use below codes.

$ scrapy crawl car -o results.json

$ scrapy crawl car -o results.csv

$ scrapy crawl car -o results.xml

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets/img		assets/img
catawiki		catawiki
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Catawiki Crawler

Intallation

Initiallization

Save results as json/csv/xml

result example

Table Classic Cars

Table Sellers

License

About

Releases

Packages

Languages

seyedmahdiamin1998/catawiki

Folders and files

Latest commit

History

Repository files navigation

Catawiki Crawler

Intallation

Initiallization

Save results as json/csv/xml

result example

Table Classic Cars

Table Sellers

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages