II.9 Design Decisions
Note
You may like to understand some of the important, huge, expensive, risky, or otherwise special architecture and design decisions.
It's especially interesting to understand the reasons for these decisions.
9.1 Checking of External Links Postponed
In the current version of HtmlSC, we can't check external links. These checks have been postponed for later versions.
9.2 HTML Parsing with jsoup
To check HTML, we parse it into an internal (DOM-like) representation. For this task, we use jsoup (https://jsoup.org), an open source parser without external dependencies.
To quote from their website:
"jsoup is a Java library for working with real-world HTML. It provides a very convenient API for extracting and manipulating data, using the best of DOM, CSS, and jQuery-like methods."
Goals of this decision: Check HTML programmatically by using an existing API that provides access and finder methods to the DOM tree of the file(s) to be checked.
Decision criteria would...