Java Web Scraper project is returning null instead of normal links

Question

Used maven for htmlunit dependency for the webscraper. The main issue is that my scraper returns null instead of links. I made an item class to set and get. } Result: basically a line of null going down *note: Putting System.out.println(link) returns one link and reuses that same link as it prints new line, i…

Accepted Answer

This works herepublic static void main(String[] args) throws IOException {    String url = "https://sfbay.craigslist.org/search/sss?query=iphone%208&sort=rel";    try (final WebClient webClient = new WebClient()) {        HtmlPage page = webClient.getPage(url);        // webClient.waitForBackgroundJavaScript(10_000);        List<HtmlElement> items = page.getByXPath("//li[@class='result-row']");        for(HtmlElement htmlItem : items){             HtmlAnchor itemAnchor = ((HtmlAnchor)htmlItem.getFirstByXPath("a[@class='result-image gallery']"));             if (itemAnchor != null) {               String link = itemAnchor.getHrefAttribute();               System.out.println("-> " + link);             }        }    }}producing something like-> https://sfbay.craigslist.org/eby/pho/d/walnut-creek-original-new-defender/7470991009.html-> https://sfbay.craigslist.org/eby/pho/d/walnut-creek-original-new-defender/7471913572.html-> https://sfbay.craigslist.org/eby/pho/d/walnut-creek-original-new-defender/7471010388.html....

Advertisement

Answer