class Nokogiri::HTML4::SAX::Parser
💡 This class is an alias for Nokogiri::HTML4::SAX::Parser as of v1.12.0.
This parser is a SAX style parser that reads its input as it deems necessary. The parser takes a Nokogiri::XML::SAX::Document, an optional encoding, then given an HTML input, sends messages to the Nokogiri::XML::SAX::Document.
âš This is an HTML4 parser and so may not support some HTML5 features and behaviors.
Here is a basic usage example:
class MyHandler < Nokogiri::XML::SAX::Document def start_element name, attributes = [] puts "found a #{name}" end end parser = Nokogiri::HTML4::SAX::Parser.new(MyHandler.new) # Hand an IO object to the parser, which will read the HTML from the IO. File.open(path_to_html) do |f| parser.parse(f) end
For more information on SAX parsers, see Nokogiri::XML::SAX or the parent class Nokogiri::XML::SAX::Parser.
Also see Nokogiri::XML::SAX::Document for the available events.