w:en:HTML Tidy

{{short description|Application for correcting invalid HTML}}

{{Refimprove|date=August 2012}}

{{Infobox software

| logo =

| screenshot =

| caption =

| collapsible =

| author = Dave Raggett

| developer = HTML Tidy Advocacy Community Group

| released =

| discontinued =

| latest release version = {{wikidata|property|reference|P348}}

| latest release date = {{start date and age|{{wikidata|qualifier|P348|P577}}}}

| programming language = C

| operating system = {{#statements:operating system}}

| platform =

| size =

| language =

| genre = Library, Console Application

| license = {{#statements:copyright license}}

}}

HTML Tidy is a console application for correcting invalid HyperText Markup Language (HTML), detecting potential web accessibility errors, and for improving the layout and indent style of the resulting markup. It is also a cross-platform library for computer applications that provides HTML Tidy's features.

History

HTML Tidy was developed by Dave Raggett{{cite web | url=https://www.w3.org/People/Raggett/tidy/ | title=Clean up your Web pages with HTML TIDY | first=Dave | last=Raggett | work=W3C.org | access-date=2015-02-12}} (Dave Raggett's legacy HTML Tidy page.) of the World Wide Web Consortium (W3C). Later it was released as a SourceForge project in 2003 and managed by various maintainers.{{cite web|url=http://tidy.cvs.sourceforge.net/tidy/ |title=SourceForge.net Repository - [tidy] Index of / |publisher=Tidy.cvs.sourceforge.net |access-date=2015-04-25}}

In 2012, the project was moved to GitHub,{{github|w3c/tidy-html5}} and maintained by Michael Smith, also of W3C,{{cite web | url=https://people.w3.org/mike//| title=Michael[tm] Smith | first=Michael | last=Smith | work=W3C.org | access-date=2015-02-12}} where HTML5 support was added.

In 2015, the HTML Tidy Advocacy Community Group (HTACG) was formed for management and development of HTML Tidy as a W3C Community Group.{{cite web|url=http://www.htacg.org |title=HTACG |publisher=HTACG.org |access-date=2015-04-25}}{{cite web|author=Jim Derry |url=https://www.w3.org/community/htacg/ |title=HTML Tidy Advocacy Community Group |date=15 January 2015 |publisher=W3.org |access-date=2015-04-25}}

HTML Tidy source code is written in ANSI C for portability. Compiled binary files are available for a variety of platforms. It is available under the W3C Software Notice and License, a permissive BSD-style license. Up-to-date versions are available as source code cloned from its GitHub Git version control repository, or in binary packages for multiple operating systems from its GitHub Releases repository.

Features

Examples of corrections to invalid or poorly constructed HTML:

  • Reorder mixed-up tags
  • Complete missing or mismatched end tags
  • Add missing syntax elements (some tags, quotes, etc.)
  • Report proprietary HTML extensions
  • Change layout of markup to a predefined style
  • Transform characters from some encodings into HTML entities

See also

{{Portal|Free and open-source software}}

References

{{Reflist|30em}}