Topic: Word text to HTML |
|
---|---|
Author | Thread |
Paranoid (IV) Inmate From: Mpls, MN |
posted 06-13-2007 21:07
Ok, I've got a stupid question. |
Paranoid (IV) Inmate From: Norway |
posted 06-13-2007 21:14
HTML Tidy for the win! |
Paranoid (IV) Inmate From: Mpls, MN |
posted 06-13-2007 21:22
Thanks, |
Paranoid (IV) Inmate From: Norway |
posted 06-13-2007 21:38
Last time I tried, i.e. years ago, DreamWeaver did a good job at removing the crap in WORD's markup. But HTML Tidy is supposed to it very well too. It's worth google'ing for examples on how to do it. |
Paranoid (IV) Inmate From: Florida |
posted 06-14-2007 02:38
You just need some sed(/ish) scripts, or an editor that does macros, etc. If Word can save as HTML and the only problem is that decent HTML is surrounded by tons of crap HTML, it might be best to go that route - save as HTML from Word, then run it through a script/macros that clean out the crap. |
Paranoid (IV) Inmate From: Mpls, MN |
posted 06-14-2007 03:41
I got it Tidy working perfectly now. I found if I export the doc as HTML from word then run it trough HTML Tidy twice. I get exactly what I was after. WeBuilder 2007 has a very nice front end to Tidy, but TidyGui would do the trick as well. |
Paranoid (IV) Inmate From: Florida |
posted 06-14-2007 22:08
Twice, ha. Never tried that; neat. |
Obsessive-Compulsive (I) Inmate From: |
posted 07-04-2007 20:43
I wouldn't trust a software package with a ten foot pole for this. But coming from a Search Engine Optimization I like to have my content with as minimal an amount as possible of markup. If I am using a WYSIWYG editor I copy and paste into Textpad first, then build the page. If its Dreamweaver I paste right into the source view and wrap it in tags as I go. I find it really doesn't take that much time with the exception of tabular data. Cheers. |