[LON-CAPA-dev] HTML to XHTML

Gerd Kortemeyer lon-capa-dev@mail.lon-capa.org
Mon, 26 Aug 2002 17:43:36 -0400


This is a multi-part message in MIME format.
--------------83FCFF4947E36C0C68C4AE52
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit

I saw this (see below) float by on the modperl mailing list. One of these days
I need to try out how good this might be for autoconversion "cleanup" of HTML
to XML:

My suggestion would to just use a XML parser module like XML::LibXML.
Load the file up using the HTML loading functions and print it using the
XML printing functions ... since the only difference I can see between
HTML and XHMTL is that optional ending tags are no longer optional (per
XML spec) and single tags must be ended properly (per XML spec).

--------------83FCFF4947E36C0C68C4AE52
Content-Type: text/x-vcard; charset=us-ascii;
 name="korte.vcf"
Content-Transfer-Encoding: 7bit
Content-Description: Card for Gerd Kortemeyer
Content-Disposition: attachment;
 filename="korte.vcf"

begin:vcard 
n:Kortemeyer;Gerd
tel;fax:(517) 432-2175
tel;work:(517) 432-5468
x-mozilla-html:FALSE
url:http://www.lite.msu.edu/kortemeyer/
org:LITE Lab;DSME MSU
version:2.1
email;internet:korte@lite.msu.edu
title:Instructional Technology Specialist
adr;quoted-printable:;;123 North Kedzie Labs=0D=0AMichigan State University;East Lansing;MI;48824;USA
fn:Gerd Kortemeyer
end:vcard

--------------83FCFF4947E36C0C68C4AE52--