The Amazon Scraper is a powerful tool designed to extract detailed product information from Amazon. It allows you to scrape product details like prices, reviews, and stock availability without the need for the Amazon API.
Created by Bitbash, built to showcase our approach to Scraping and Automation!
If you are looking for Amazon Scraper you've just found your team β Let's Chat. ππ
This project is a web scraper that helps users collect product data from Amazon, including prices, reviews, ratings, and availability. It simplifies the process of gathering product information for competitive analysis, market research, or personal use.
- Scrapes Amazon product pages without using the official API.
- Extracts comprehensive data, including reviews, prices, and availability.
- Simple to use: just input an Amazon URL and start scraping.
- Outputs data in a structured JSON format for easy integration with other tools.
| Feature | Description |
|---|---|
| Product Details Extraction | Scrapes detailed information like title, price, ASIN, etc. |
| Review & Rating Scraping | Collects reviews and ratings, including breakdowns by stars. |
| Stock and Availability Info | Includes stock status and expected delivery dates. |
| Customizable Scraping Limits | Allows you to specify the number of items to scrape per request. |
| Easy Setup | Quick start with minimal setup and configuration. |
| Field Name | Field Description |
|---|---|
| title | The title of the product. |
| url | The URL of the product on Amazon. |
| asin | Amazon Standard Identification Number, unique to each product. |
| inStock | Boolean indicating if the product is in stock. |
| price | Current price of the product, including currency. |
| listPrice | The original price before discounts, if available. |
| stars | Average star rating of the product. |
| reviewsCount | Number of reviews the product has received. |
| answeredQuestions | The number of questions answered for the product. |
| breadCrumbs | Categories and subcategories for the product. |
| thumbnailImage | The URL to the product's thumbnail image. |
[
{
"title": "SanDisk 1TB Extreme microSDXC UHS-I Memory Card with Adapter - Up to 190MB/s",
"url": "https://www.amazon.com/dp/B09X7MPX8L",
"asin": "B09X7MPX8L",
"inStock": true,
"price": {
"value": 145.5,
"currency": "$"
},
"stars": 4.8,
"reviewsCount": 36704,
"answeredQuestions": 151,
"breadCrumbs": "Electronics βΊ Computers & Accessories βΊ Computer Accessories & Peripherals βΊ Memory Cards βΊ Micro SD Cards",
"thumbnailImage": "https://m.media-amazon.com/images/I/716kSUlHouL.__AC_SX300_SY300_QL70_FMwebp_.jpg"
}
]
Amazon Scraper/
βββ src/
β βββ runner.py
β βββ extractors/
β β βββ amazon_parser.py
β βββ outputs/
β β βββ exporters.py
β βββ config/
β βββ settings.example.json
βββ data/
β βββ inputs.sample.txt
β βββ sample.json
βββ requirements.txt
βββ README.md
- E-commerce Researchers use it to monitor product performance, helping them benchmark their own products against competitors.
- Marketing Professionals scrape Amazon reviews and prices to refine their ad strategies and messaging.
- Data Analysts gather comprehensive product data for reporting, trend analysis, and business intelligence.
Q1: Can I scrape any Amazon page with this tool?
- Yes, this tool can scrape any publicly accessible product page on Amazon.
Q2: Does it support multiple Amazon countries?
- Currently, it supports Amazon.com, but modifications can be made to support other regional versions.
Q3: What do I do if Amazon blocks my requests?
- Amazon may limit requests based on traffic. To overcome this, you can adjust scraping limits or use rotating proxies.
Primary Metric: Average scrape speed of 100 products per minute, depending on page complexity.
Reliability Metric: 98% successful extraction rate, with occasional Amazon throttling.
Efficiency Metric: Utilizes minimal resources with low memory and CPU usage.
Quality Metric: 95% data completeness for title, price, and reviews; occasional missing description.
