Jsoup

{{lowercase|title=jsoup}}

{{Infobox software

| name = jsoup Java HTML Parser

| developer = Jonathan Hedley

| programming language = Java

| platform = Java (JVM)

| genre = HTML parser

| license = MIT license

| website = {{url|https://jsoup.org/}}

| latest_release_version = 1.21.1

| latest_release_date = {{Start date and age|2025|06|23}}{{cite web|url=https://jsoup.org/news/release-1.21.1|title=jsoup Java HTML Parser release 1.21.1

|access-date=2025-06-23}}

| operating_system = Cross-platform

}}

jsoup is an open-source Java library designed to parse, extract, and manipulate data stored in HTML documents.

History

jsoup was created in 2009 by Jonathan Hedley. It is distributed it under the MIT License, a permissive free software license similar to the Creative Commons attribution license.

Hedley's avowed intention in writing jsoup was "to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup."

Projects powered by jsoup

jsoup is used in a number of current projects,{{Cite web |url=http://mvnrepository.com/artifact/org.jsoup/jsoup |title=Jsoup |publisher=MVNRepository / F. Rodriguez |date=2015-03-08}} including Google's OpenRefine data-wrangling tool.

See also

References

{{Reflist}}