MXNA 2.0 and Character Encoding

Supporting so many different languages with MXNA 2.0 has forced me to learn a
great deal about character encoding, which I’m very thankful for. I have been
writing US/English-centric applications for too long, and I’m very happy to now
be supporting the international community. We are now aggregating 55 non-English
feeds in 16 different languages, and I have about a dozen more in the queue
waiting to be approved. I’d say we’re adding non-English feeds at a rate of about
25 per week, at this point.

This level of support has not been easy. Unfortunately, parsing, storing, and
displaying information in any language takes a fair amount of work, even with languages
like ColdFusion and Java. I have fixed several encoding related bugs over the last
few days, so if you have seen character encoding problems, please go back and try
again. Specifically, I have fixed the following:

  • When servers do not report a specific character encoding, or report the wrong
    encoding, MXNA can still parse the feed.
  • The character encoding for the generated
    RSS feeds and OPML files should now be correct (utf-8) and all characters
    should be rendered properly.
  • The character encoding for MXNA
    is now correct, and
    all characters should be rendered properly.

Everything in MXNA 2.0 should be encoded as UTF-8, and all languages and characters
should be supported with no exceptions. If you see any encoding issues, please let
me know so that we can continue to provide the international community with the
best support possible.

Oh, and if you happen to see some non-English posts that you don’t think are appropriate
for MXNA, please let me know. We’re counting on the community to help police
and approve content.

5 Responses to MXNA 2.0 and Character Encoding

  1. Hi All,Thanks for Unicode support on new MXDA.Now We have our names and entries in right format :)Hey Ray, are you there? We need also an Unicode RSS support on BlogCFC. :)I just made my own but I think it would be a good idea to have Unicode RSS support on BlogCFC.Thanks again for everyone here …

  2. JesterXL says:

    Pimp, I just got the mobile version on my phone. However, upon clicking the links, it goes to a normal webpage which wasn’t made to display in a phone. Is there a way to register a feed made for mobile so it goes to a mobile formatted page? What options do I have as both a content provider (my blog), and using my phone to read pages?

  3. PaulH says:

    the generate RSS method already is unicode capable (the whole blog is basically). sure it’s not an RSS client or aggregator? i know many of these aren’t capable of handling unicode.

  4. PaulH says:

    christian,i think it would really be useful if you expanded on how you handled un-hinted RSS feeds, the encoding mistakes you made, etc.

  5. Kola says:

    I agree with PaulH, an article on what you learned would be of great value to the community