gsqlcmd HTML Cleaner Options
/autoCorrectedTags=<tag>[;...]
Use this option to define HTML tags closed automatically.
For example, XHTML uses the following syntax:
<ul> <li>item1</li> <li>item2</li> </ul>
So, the <li> tag must have a closing </li> tag.
To parse HTML without the closing tag, you need to add it to the autoCorrectedTags option.
<ul> <li>item1 <li>item2 </ul>
The default value for this option contains tags: li, p, and a.
You can change the default value in the configuration file.
/cleanHtml
Use this option to clean downloaded HTML files.
You can use the clean-html mode to clean local files and find the right cleaning options.
You can customize cleaning rules with the configuration file and the following output options:
/autoCorrectedTags, /cutAttributes, /cutComments, /cutIDs, /cutTags
/cutAttributes=<attribute>[;...]
Use this option to remove specified HTML attributes in the clean-html mode or the /cleanHtml option.
For example:
/cutAttributes=data-vars-event-action,data-vars-event-label
/cutComments
Use this option to remove HTML comments in the clean-html mode or the /cleanHtml option.
/cutIDs=<id>[;...]
Use this option to remove HTML nodes with the specified id values in the clean-html mode or the /cleanHtml option.
You can use this option, for example, to remove navigation and advertisement elements.
/cutScripts
Use this option to remove HTML script and noscript tags and on* event attributes in the clean-html mode or the /cleanHtml option.
/cutStyles
Use this option to remove HTML style tags and style and class attributes in the clean-html mode or the /cleanHtml option.
/cutTags=<tag>[;...]
Use this option to remove HTML nodes with the specified tags in the clean-html mode or the /cleanHtml option.
For example:
/cutTags=amp-install-serviceworker,amp-state,amp-analytics,amp-user-notification