Package | Description |
---|---|
org.archive.wayback.util.htmllex |
Modifier and Type | Class and Description |
---|---|
class |
ContextAwareLexer
The Lexer that comes with htmlparser does not handle non-escaped HTML
entities within SCRIPT tags - by default, something like:
<script>
for(var i=0; i<23; i++) { j+=i; }
</script>
Can cause the lexer to skip over a large part of the document.
|
Copyright © 2005–2017 IIPC. All rights reserved.