jsoup
Developer(s) | Jonathan Hedley |
---|---|
Stable release | 1.14.3
/ September 30, 2021[1] |
Repository | |
Written in | Java |
Operating system | Cross-platform |
Platform | Java (JVM) |
Type | HTML parser |
License | MIT license |
Website | jsoup |
jsoup is an open-source Java library designed to parse, extract, and manipulate data stored in HTML documents.
History[]
jsoup was created in 2009 by Jonathan Hedley. He has distributed it under the MIT License, a permissive free software license similar to the Creative Commons attribution license.
Hedley's avowed intention in writing jsoup was "to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup."
Projects powered by jsoup[]
jsoup is used in a number of current projects,[2] including Google's OpenRefine data-wrangling tool.
See also[]
- Comparison of HTML parsers
- Web scraping
- Data wrangling
- MIT License
References[]
- ^ "jsoup Java HTML Parser release 1.14.3". Retrieved 30 September 2021.
- ^ "Jsoup". MVNRepository / F. Rodriguez. 2015-03-08.
External links[]
Categories:
- Java (programming language) libraries
- Free software programmed in Java (programming language)
- XML parsers
- HTML parsers
- Web scraping