CP-1252 Fixer

About

You work with the internet.
Your job involves copying and pasting content from Microsoft Word.
Funny characters like ’, ‘ and âˆÀˆÙ show up on your webpages.

This is a small program which runs in your system tray and monitors your clipboard for the presence of "smart quotes" and other characters normal HTML documents may have problems with. These are automatically fixed without any action on the user's part, leaving you able to go about your work error-free.

Usage

Simply copy CP1252Fixer.exe somewhere appropriate and run it - its icon should now be visible in your system tray.

Right-click the system tray icon to set the various replacement options or toggle replacements entirely. Left clicking the icon shows the main window, with help tooltips for each option and a display of the current clipboard text.

Download

CP1252Fixer.exe (112KB)
A mirror (and full source code) is also available on Github.

Features

Explanation

Microsoft Word, Outlook, Powerpoint and some other Windows programs use a feature called "smart quotes", which is designed to automatically substitute different characters for keys on your keyboard as you type. This is in an attempt to promote readability via the use of typographer's quotes and punctuation marks. As a result:

When you type... this appears:
'text ‘text
text' text’
"text “text
text" text”
- text • text
--
-
...

When pasted into a normal HTML document encoded as ASCII text, the character set of the document doesn't know how to display these characters. Instead, they are spit out as the combination of ASCII letters which make up the longer number of the extended character, resulting in broken text. For example, ‘ is 0xE28098:

           0xE28098
         = 0xE2 0x80 0x98
in ASCII:  â    €    ˜

These problems can be avoided by:

However even then, you should keep your text portable! Anyone copying these characters off your webpage onto a Linux or Mac computer will not be able to read them reliably, as they are Windows - only characters and have no equivalents or goddamn place elsewhere in the digital world. Seriously. Even Microsoft's own file formats encode them as escape sequences. Guys.

License

This software is provided under an MIT open source license, read the LICENSE.txt file for details.

Known Issues