Community Learn Tools Library Leisure

English

Home > Java > javaTutorial > Java Example - Web Scraping

Java Example - Web Scraping

黄舟

Release： 2017-01-20 11:58:43

Original

1393 people have browsed it

The following example demonstrates how to use the URL() constructor of the net.URL class to crawl a web page:

/*
 author by w3cschool.cc
 Main.java
 */import java.io.BufferedReader;import java.io.BufferedWriter;import java.io.FileWriter;import java.io.InputStreamReader;import java.net.URL;public class Main {
   public static void main(String[] args) 
   throws Exception {
      URL url = new URL("http://www.w3cschool.cc");
      BufferedReader reader = new BufferedReader
      (new InputStreamReader(url.openStream()));
      BufferedWriter writer = new BufferedWriter
      (new FileWriter("data.html"));
      String line;
      while ((line = reader.readLine()) != null) {
         System.out.println(line);
         writer.write(line);
         writer.newLine();
      }
      reader.close();
      writer.close();
   }}

Copy after login

The output result of running the above code is (the source code of the web page, stored in the current directory data.html file under):

<!DOCTYPE html> <html> <head> <meta charset="UTF-8"/> 
<meta http-equiv="X-UA-Compatible" content="IE=11,IE=10,IE=9,IE=8"/>……

Copy after login

The above is the Java example-web page crawling content. For more related content, please pay attention to the PHP Chinese website (m.sbmmt.com)!

Related labels：

Java ，网页抓取

source：php.cn

Previous article：Java Example - Connect to the specified host using Socket Next article：A small example of generating and parsing QR code images using ZXing in Java

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Latest Articles by Author

Video material on building your own PHP framework from scratch

2023-03-15 16:54:01
Example analysis of how PHPMailer uses QQ mailbox to complete the email sending function

2023-03-15 12:26:02
Introduction to how to receive emails in IMAP in php

2023-03-14 18:58:01
Example of how to quickly implement array deduplication in PHP

2023-03-14 11:30:01
Summary of the use of all attributes of the tag in html

1970-01-01 08:00:00
Summary of basic knowledge of PHP (necessary for beginners to get started)

2023-03-16 15:20:01
Introduction to the use of typeof in JavaScript

1970-01-01 08:00:00
Introduction to the use of confirm() method in JavaScript

1970-01-01 08:00:00
A detailed introduction to the HTML5 Placeholder attribute

1970-01-01 08:00:00
How to implement single-select, multiple-select and reverse-select in forms in ReactJS

1970-01-01 08:00:00

Latest Issues

How do I get my image to appear on the page's main display? What I want to do is receive some photos using NASAAPI. These photos are then displayed on...

From 2024-04-06 15:33:12

0

1

433

return(); doesn't work for 1 route but works for almost the same route I have 2 routes, one for unsubscribing and one for restoring, both routes are the same exc...

From 2024-04-04 17:34:09

0

1

311

Scrapy: Guide to saving to CSV with custom column settings So basically I'm scraping data from the web and I have a project file imported into my mai...

From 2024-04-04 14:01:17

0

1

301

Web scraping: Missing href attribute - Need to simulate mouse clicks for web scraping? For a fun web scraping project, I want to collect NHL data from ttps://www.nhl.com/stats/t...

From 2024-04-04 10:32:06

0

1

3473

How to search title and use another column to check uniqueness I have some scraped product data in the database and I want to use it on my website. I wan...

From 2024-04-02 21:49:55

0

1

375

Related Topics

More>

Popular Recommendations

Popular Tutorials

More>

Related Tutorials

Popular Recommendations

Latest courses

Latest Downloads

More>

Web Effects

Website Source Code

Website Materials

Front End Template