getXMLErrors               package:XML               R Documentation

_G_e_t _X_M_L/_H_T_M_L _d_o_c_u_m_e_n_t _p_a_r_s_e _e_r_r_o_r_s

_D_e_s_c_r_i_p_t_i_o_n:

     This function is intended to be a convenience for  finding all the
     errors in an XML or HTML document due to being malformed, i.e.
     missing quotes on attributes, non-terminated elements/nodes,
     incorrectly terminated nodes, missing entities, etc. The document
     is parsed and a list of the errors is returned along with
     information about the file, line and column number.

_U_s_a_g_e:

     getXMLErrors(filename, parse = xmlInternalTreeParse, ...)

_A_r_g_u_m_e_n_t_s:

filename: the identifier for the  document to be parsed, one of a local
          file name,  a URL or the XML/HTML content itself

   parse: the function to use to parse the document, usually either 
          'xmlTreeParse' or  'htmlTreeParse'. 

     ...: additional arguments passed to the function given by 'parse'

_V_a_l_u_e:

     A list of S3-style  'XMLError' objects.

_A_u_t_h_o_r(_s):

     Duncan Temple Lang

_R_e_f_e_r_e_n_c_e_s:

     libxml2 (<URL: http://xmlsoft.org>)

_S_e_e _A_l_s_o:

     'error' argument for 'xmlTreeParse' and related functions.

_E_x_a_m_p_l_e_s:

          # Get the "errors" in the HTML that was generated from this Rd file
       getXMLErrors(system.file("html", "getXMLErrors.html", package = "XML"))

     ## Not run: 
       getXMLErrors("http://www.omegahat.org/index.html")
     ## End(Not run)

