Skip to content

A Java web scraper using Jsoup to extract headers, links, and other elements from web pages. Includes retry logic for failed connections and file-saving functionality.

License

Notifications You must be signed in to change notification settings

JFCiscoHuerta/java-web-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

JavaWebScraper

Java License Build

WebScrapper is a Java-based tool designed to scrape and collect data from web pages using Jsoup. This scraper allows you to extract various elements, such as titles (h1, h2, h3, h4), links, and any other specific elements defined by CSS selectors.

The tool supports connection retries in case of failures, and it provides functionality to save the scraped data to a file. Customize the scraper by providing a user agent and target URL to start gathering data from any website.

License

This project is licensed under the Apache 2.0 license - see the LICENSE file for detail

About

A Java web scraper using Jsoup to extract headers, links, and other elements from web pages. Includes retry logic for failed connections and file-saving functionality.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages