BeautifulSoup & Selenium Book Automation

This Python project scrapes book data from Books to Scrape and automatically submits it into a Google Form. It also includes optional data analytics for visualizing the scraped data.

Features

Scrapes book information from multiple pages:
- Title
- URL
- Price
- Rating
- Stock availability
Automatically submits the data to a Google Form
Optional analytics using Matplotlib to visualize average price by rating
Users can use their own Google Form with 5 text input fields:
1. Book Title
2. Book URL
3. Book Price
4. Book Rating
5. Book Stock

Getting Started

1. Clone the repository

git clone https://github.com/MPALONDON/BeautifulSoup-Selenium-Automation.git

2. Navigate to your project folder

cd <your-project-folder>

3. Install dependencies

It’s recommended to use a virtual environment:

python -m venv .venv
.venv\Scripts\activate   # Windows
# source .venv/bin/activate   # Mac/Linux
pip install -r requirements.txt

4. Set up your Google Form (Optional)

If you want to use your own Google Form:

Create a Google Form with 5 text inputs in this order:
- Book Title
- Book URL
- Book Price
- Book Rating
- Book Stock
Copy the form URL.
Create a .env file in the project root with:

GOOGLE_FORM_URL="YOUR_GOOGLE_FORM_URL_HERE"

If .env is not provided, the program will use the default form URL.

5. Run the program

python main.py

The program will ask how many pages to scrape.
It will scrape the data from the specified number of pages and submit it to your Google Form.
Optional analytics will be displayed automatically before submission.

Analytics

The program creates a simple bar chart showing average book price by rating using Matplotlib. This allows you to quickly visualize trends in the scraped data.

Notes

Currently uses time.sleep() to wait for pages to load; WebDriverWait is not implemented.
Make sure your Google Form is open to accept responses.
Only works for Books to Scrape structure.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.idea		.idea
Data		Data
.gitignore		.gitignore
README.md		README.md
Requirements.txt		Requirements.txt
main.py		main.py
plotting.py		plotting.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BeautifulSoup & Selenium Book Automation

Features

Getting Started

1. Clone the repository

2. Navigate to your project folder

3. Install dependencies

4. Set up your Google Form (Optional)

5. Run the program

Analytics

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

BeautifulSoup & Selenium Book Automation

Features

Getting Started

1. Clone the repository

2. Navigate to your project folder

3. Install dependencies

4. Set up your Google Form (Optional)

5. Run the program

Analytics

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages