xml handling in php4

given xml input as a string with a xml declaration that specifies the encoding, what can i do to always get back utf-8 data in my character data handler?

that is, how do i get "i ♥ são paulo" as utf-8 given either of these two inputs:

<?xml version="1.0" encoding="iso-8859-1"?>
<foo>i &#9829; são paulo</foo>

<?xml version="1.0"?>
<foo>i &#9829; s&#227;o paulo</foo>

handling the second case is easy, it is the first case that i can't figure out.

and someone deserves a kick in the teeth for making php4's xml parser default to iso-8859-1.

comments

Have you tried:

xml_parser_set_option($parser, XML_OPTION_TARGET_ENCODING, 'UTF-8')

-adam

» Adam Trachtenberg (link) » july 25, 2003 11:30pm

you may also specify an encoding to xml_parser_create() which should work.

» Sterling Hughes (link) » july 26, 2003 5:00am

S

» Joao Prado Maia (link) » july 28, 2003 10:25am

add a comment

sorry, comments on this post are closed.