Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vue.js
Gerapy is a distributed crawler management framework based on Scrapy, Scrapyd, Scrapyd-Client, Scrapyd-API, Django, and Vue.js. It allows users to create configurable projects, generate code for Scrapy, and manage the deployment and monitoring of jobs. Gerapy is developed using Python 3.x and can be installed via pip.
To install Gerapy, use the following command:
pip install gerapy
After installation, follow these steps to run the Gerapy server:
gerapy init
gerapy
folder:cd gerapy
gerapy migrate
gerapy createsuperuser
gerapy runserver
You can now access Gerapy by visiting http://localhost:8000. The admin management backend can be accessed at http://localhost:8000/admin.
If you want to run Gerapy in public, use the following command:
gerapy runserver 0.0.0.0:8000
Gerapy is a distributed crawler management framework that simplifies the process of creating and deploying web crawlers. It provides features such as configurable projects, automated code generation, and deployment management. With its support for Docker, Gerapy offers a convenient solution for setting up and running crawler tasks.