Jsoup can be used to manipulate or extract data from a file on local that contains HTML. filePath
is path of a file on disk. ENCODING
is desired Charset Name e.g. "Windows-31J". It is optional.
// load file
File inputFile = new File(filePath);
// parse file as HTML document
Document doc = Jsoup.parse(filePath, ENCODING);
// select element by <a>
Elements elements = doc.select("a");