I have used this method to retrieve a webpage into an org.jsoup.nodes.Document object: myDoc = Jsoup.connect(myURL).ignoreContentType(true).get(); How should I write this object to a HTML file? The …
Advertisement
Tag: document
Extract the first page content from docx file by XML parsing
I need to extract the first page content from the docx file and save it as a seperate document. I need everything from the first page( images, tables, text) to be saved as it is in new docx file. What i tried is : I looked into the xml of the unzipped docx file. Since word document is reflowable i
Advertisement