<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Gabriel de Kadt &#187; Word 2004</title>
	<atom:link href="http://www.lazydada.com/tag/word-2004/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.lazydada.com</link>
	<description>Personal notes on Mac based web development and design.</description>
	<lastBuildDate>Tue, 30 Aug 2011 10:47:39 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Clean, formatting-free XHTML from Word for posting into blogs and CMSs?</title>
		<link>http://www.lazydada.com/2008-06-05/clean-formatting-free-xhtml-from-word-for-posting-into-blogs-and-cmss/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=clean-formatting-free-xhtml-from-word-for-posting-into-blogs-and-cmss</link>
		<comments>http://www.lazydada.com/2008-06-05/clean-formatting-free-xhtml-from-word-for-posting-into-blogs-and-cmss/#comments</comments>
		<pubDate>Thu, 05 Jun 2008 17:04:43 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Technology]]></category>
		<category><![CDATA[Web design & development]]></category>
		<category><![CDATA[MacOSX]]></category>
		<category><![CDATA[webdev]]></category>
		<category><![CDATA[Word 2004]]></category>
		<category><![CDATA[writing]]></category>
		<category><![CDATA[XHTML]]></category>

		<guid isPermaLink="false">http://www.lazydada.com/?p=39</guid>
		<description><![CDATA[&#8212;&#8212;&#8211; Update 2 Thanks to this great list of open-source apps for Mac OS X there another option to look at: AbiWord. I&#8217;ve not tried it yet but it looks set to beat my non-starting attempts with OpenOffice (office Mac is PowerPC without X11). Even better &#8211; it seems to state that it can do the [...]]]></description>
			<content:encoded><![CDATA[<p>&#8212;&#8212;&#8211;</p>
<h3>Update 2</h3>
<p>Thanks to this <a title="Seven open-source Mac apps you need right now" href="http://www.macworld.co.uk/macsoftware/news/index.cfm?RSS&amp;NewsID=21662" target="_blank">great list of open-source apps for Mac OS X</a> there another option to look at: <a title="Open-source word processor for Mac OS X" href="http://www.abisource.com" target="_blank">AbiWord</a>. I&#8217;ve not tried it yet but it looks set to beat my non-starting attempts with OpenOffice (office Mac is PowerPC without X11). Even better &#8211; it seems to state that <a title="Authoring Web-Clean AbiWord Documents" href="http://www.abisource.com/help/en-US/howto/howtoweb.html" target="_blank">it can do the job</a>. TBC.</p>
<p>&#8212;&#8212;&#8211;</p>
<h3>Update</h3>
<h2>What a difference a day makes</h2>
<p>Despite yesterday&#8217;s test proving otherwise &#8211; today it seems that I <strong>can</strong> use Paste Special in to GoLive and keep all formatting. Today I can &#8220;Paste As&#8221; and chose &#8220;Cleared HTML (Removes exotic Markup)&#8221; over the limited &#8220;HTML&#8221; option which was all I could do yesterday. Just a few extra non-breaking spaces and p tags for the line breaks &#8211; but otherwise perfect unicode for the web.</p>
<p>I think that I may have not been pasting directly from word. Not sure. Anyway this is now the best solution for when I&#8217;m about to help update content.<br />
&#8212;&#8212;&#8211;<br />
[Posted to the Microsoft Word forum <a href="http://www.officeformac.com/3192">here</a>]</p>
<p>I&#8217;ve been searching for a while now and have found no simple solution for this issue. I&#8217;m working to set up a CMS (Drupal in this case) and want to find a way to enable the writers &#8211; using Word 2004 &#8211; to upload their own content, properly styled in clean XHTML.</p>
<p>I want to avoid any extra steps as more steps leads to more chances for errors to creep in. The only formatting needed is semantic content; just HTML body content without extraneous Word Roundtrip information or formatting at all as all design should be defined by using CSS stylesheets.</p>
<p>I just want the basic stuff i.e. h1-h6 headings (defined at the authoring stage, using Word&#8217;s standard styles), bold, italics and quality typography (all accents, &#8220;curly&#8221; quotes and em-dashes) properly encoded into human readable XHTML entities (ie &#8220;&amp;&#8221; becomes &#8220;&amp;&#8221;).</p>
<p>I&#8217;m worried that I&#8217;m going to have to compromise on quality or make, what to my mind should be basic functionality, a laborious and error-prone process&#8230;</p>
<p>Does anybody have a solution? Is this doable by hacking/editing the &#8220;Word Conversion Options&#8221; or &#8220;com.microsoft.Word.prefs.plist&#8221; files?</p>
<p>[The basic structure: headings and paragraphs; bold; italics and accents (as unicode) can be handled by the CMS's interface thanks to TinyMCE and its Paste From Word function - but this cannot handle typographic features such as proper curly quotes and em-dashes.]</p>
]]></content:encoded>
			<wfw:commentRss>http://www.lazydada.com/2008-06-05/clean-formatting-free-xhtml-from-word-for-posting-into-blogs-and-cmss/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
<!-- WP Super Cache is installed but broken. The path to wp-cache-phase1.php in wp-content/advanced-cache.php must be fixed! -->
