Microsoft Word generates clean HTML for blogs?

Awesome. One Microsoft team heard my pleas for clean XHTML. By the way, the new Word 2007 has the ability to post to blogs built in. Joe Friend has been writing about it.

Lots of Microsoft program managers push back and say "normal people don't care about HTML quality."

That might be true (although we all hate it when our pages don't display right on all browsers, or when they are slow to load) but the influentials who write reviews and tell their friends (or set up their computers) do care about such things.

One guy told me "but we have 10s of millions of users already, so why should we care?"

Well, you woulda had 10,000,001 if you had clean HTML. :-)

84 thoughts on “Microsoft Word generates clean HTML for blogs?

  1. Copy paste – even using styles and the new blogformat option is all messed up in Word 2007.

    I can take any Word 97-2003 document and copy past directly to the web – discussion boards etc and all formatting is preserved without stupid [FONT] tags inserted all over the text that is pasted.

    But try doing that in Word 2007! It just doesn’t work. About the best I can get from it is letting 2007 removing all my work put into formatting the text — and then paste it.

    What a pain!

    How can I get the simple copy-paste-that-works feature of Word 97-2003?

    Bob

  2. Copy paste – even using styles and the new blogformat option is all messed up in Word 2007.

    I can take any Word 97-2003 document and copy past directly to the web – discussion boards etc and all formatting is preserved without stupid [FONT] tags inserted all over the text that is pasted.

    But try doing that in Word 2007! It just doesn’t work. About the best I can get from it is letting 2007 removing all my work put into formatting the text — and then paste it.

    What a pain!

    How can I get the simple copy-paste-that-works feature of Word 97-2003?

    Bob

  3. I wasted a week, first trying to convert a Word 2007 document (.docx) or Word “Compatibility Mode” (.doc) with images to a webpage by saving it as .htm, or by writing the document as HTML and saving it as .htm. All I got was HTML text on the webpage! Then my son told me Word-to-HTML sucks, just write the HTML in Wordpad, save as .txt, FTP it to the server and rename it there to .htm. Works fine.

  4. I wasted a week, first trying to convert a Word 2007 document (.docx) or Word “Compatibility Mode” (.doc) with images to a webpage by saving it as .htm, or by writing the document as HTML and saving it as .htm. All I got was HTML text on the webpage! Then my son told me Word-to-HTML sucks, just write the HTML in Wordpad, save as .txt, FTP it to the server and rename it there to .htm. Works fine.

  5. Well, I tried that feature and I was amazed. If they can get it right, can’t they at least introduce a new save-as feature? Like “save-as-blog-html” so at least we can use that for those blogs that Word does not support.
    BTW, it seems they’ve ported that to (or maybe it was ported from) Microsoft Windows Live Writer. Now, I think this is a very cool tool. Works nicely with WordPress after some minimal changes to web config. I’ve documented it in my blog though.

  6. Well, I tried that feature and I was amazed. If they can get it right, can’t they at least introduce a new save-as feature? Like “save-as-blog-html” so at least we can use that for those blogs that Word does not support.
    BTW, it seems they’ve ported that to (or maybe it was ported from) Microsoft Windows Live Writer. Now, I think this is a very cool tool. Works nicely with WordPress after some minimal changes to web config. I’ve documented it in my blog though.

  7. I just posted about Word 2007 as a blog wriing tool. The content is in Thai, though. It is not as good as Writer from the blog writer perspective. Still have some problem to upload pictures to wordpress, problem with Category, no additional support for Tag, etc. But as a simple Blog writing tool, I think it is quite good. No need to learn new thing.

  8. I just posted about Word 2007 as a blog wriing tool. The content is in Thai, though. It is not as good as Writer from the blog writer perspective. Still have some problem to upload pictures to wordpress, problem with Category, no additional support for Tag, etc. But as a simple Blog writing tool, I think it is quite good. No need to learn new thing.

  9. For me, this is HUGE. A big part of my job is converting your average Word-produced business documents into something our intranet application won’t choke on. And it always chokes on the invalid markup generated by Word. Even the “Save As Web” command produces something so far from valid XHTML it’s scary. That’s before you deal with the nexted divs & font tags…ugh… nightmarish. Especially when business owners don’t understand why you can’t just copy and paste from Word.

    For a while I was using Textism Word Cleaner tool, and that worked well. Looking forward to this new feature… hopefully they’ll extend it to the rest of the Office suite (at least the ones that support “Save as web page”.

  10. For me, this is HUGE. A big part of my job is converting your average Word-produced business documents into something our intranet application won’t choke on. And it always chokes on the invalid markup generated by Word. Even the “Save As Web” command produces something so far from valid XHTML it’s scary. That’s before you deal with the nexted divs & font tags…ugh… nightmarish. Especially when business owners don’t understand why you can’t just copy and paste from Word.

    For a while I was using Textism Word Cleaner tool, and that worked well. Looking forward to this new feature… hopefully they’ll extend it to the rest of the Office suite (at least the ones that support “Save as web page”.

  11. Couldn’t agree more. I use Word for my daily work (I’m primarily a writer) but if I cut and paste text from Word into my blog (http://www.badlanguage.net) or my flying website (http://www.modernpilot.com) it comes with a complete set of Microsoft-specific tags that I don’t want or need. The easiest way to wash them out is – get this – to paste the text into Notepad and then copy it from there. How low-tech is that? It’s encouraging to know that there’s someone inside Microsoft who understands this daily frustration.

  12. Couldn’t agree more. I use Word for my daily work (I’m primarily a writer) but if I cut and paste text from Word into my blog (http://www.badlanguage.net) or my flying website (http://www.modernpilot.com) it comes with a complete set of Microsoft-specific tags that I don’t want or need. The easiest way to wash them out is – get this – to paste the text into Notepad and then copy it from there. How low-tech is that? It’s encouraging to know that there’s someone inside Microsoft who understands this daily frustration.

  13. Will it ever be likely that common open source blogging tools like WordPress will be supported?

  14. Will it ever be likely that common open source blogging tools like WordPress will be supported?

  15. This is good news but which version(s) of XHTML will Word render – 1.0 transitional, 1.1 strict or 2.0?

    I for one think this is the strongest reason to upgrade my version of Outlook and Word so long as the metaweblog API and Atom publication support allow me to blog to a variety of blogging tools and not just Spaces.

    I wonder if Microsoft could go further and fully support CSS for template formatting in Word and JavaScript for macros.

  16. This is good news but which version(s) of XHTML will Word render – 1.0 transitional, 1.1 strict or 2.0?

    I for one think this is the strongest reason to upgrade my version of Outlook and Word so long as the metaweblog API and Atom publication support allow me to blog to a variety of blogging tools and not just Spaces.

    I wonder if Microsoft could go further and fully support CSS for template formatting in Word and JavaScript for macros.

  17. You can get an amen, Darren. I’ll even throw in an Alleluia. I cannot count the hours I have wasted dorking around with terribly formatted Word output.

  18. You can get an amen, Darren. I’ll even throw in an Alleluia. I cannot count the hours I have wasted dorking around with terribly formatted Word output.

  19. Pingback: Kingsley 2.0
  20. Regarding post #24 – pretty good analysis!

    I just checked windows media player ver 10
    and it seems to allow mp3 up to 320 Kbps.
    And the max for Windows Media Audio is 192Kbps.
    There is a lossless option that goes up to 940Kbps.

    I happen to believe that you are right about previous versions. So hopefully other things will continue in this direction (of competative sotware).

Comments are closed.