Ampersands and XHTML

May 25, 2008 by Michael

A regular expression to replace all ampersands (&) in a text that are not part of an entity:

t = t.gsub(/&(?!#?\w+;)/, '&')

Language is ruby. The regexp feature used is called a negative lookahead.

No comments yet

Post a Comment

Your email is never published. We need your name and email address only for verifying a legitimate comment. For more information, a copy of your saved data or a request to delete any data under this address, please send a short notice to from the address you used to comment on this entry.
By entering and submitting a comment, wether with or without name or email address, you'll agree that all data you have entered including your IP address will be checked and stored for a limited time by Automattic Inc., 60 29th Street #343, San Francisco, CA 94110-4929, USA. only for the purpose of avoiding spam. You can deny further storage of your data by sending an email to, with subject “Deletion of Data stored by Akismet”.
Required fields are marked *