Email Address Pattern Matching

Fyfield, Derek DEREK.FYFIELD at orbit.net
Thu Aug 31 09:52:08 CDT 2006


This is what I used for bulk review of some old data with QTP.
"(^|#)@!@(^|#)@.(^|#)@"   

 

The pattern looks for :-

1.               (^|#)        an initial alpha or numeric character

2.               @                zero or any alphanumerics and special
characters

3.               (^|#)@   another set of characters beginning with at
least one alphanumeric

4.               .        a full stop

5.               (^|#)@   and a final set of characters beginning with
an alphanumeric. 

 

The patttern may let in some strange characters like { or %, but it did
clean up lots of rubbish.  

 

A look at this link suggests the above is not complete!
http://en.wikipedia.org/wiki/E-mail_address

 

 

Derek Fyfield

 

Orbit Bahrain

 

________________________________

From: powerh-l-bounces+derek.fyfield=orbit.net at lists.sowder.com
[mailto:powerh-l-bounces+derek.fyfield=orbit.net at lists.sowder.com] On
Behalf Of Gordon, Kevin
Sent: Thursday, 31 August 2006 4:47 PM
To: Powerhouse
Subject: RE: Email Address Pattern Matching

 

Here's what I came up with last time I did this:

"(^|#|!_|!-|!.)>!@((^|#|!_|!-)>!.)>^>"

I haven't reanalysed it but it did the job at the time!

 

Kevin Gordon
Sea Containers Information Services

 

________________________________

From: powerh-l-bounces+kevin.gordon=seacontainers.com at lists.sowder.com
[mailto:powerh-l-bounces+kevin.gordon=seacontainers.com at lists.sowder.com
] On Behalf Of Jim Hamer
Sent: 31 August 2006 14:23
To: Powerhouse
Subject: Email Address Pattern Matching

Greeting Listers,

 

I'm trying to develop a pattern match for a generic e-mail address to
put in the Pattern for the e_mail_address element in the dictionary.
Here's what I have so far:

 

"(^|!.|#)*!@(^|#)*!.(^|#)*"

 

That is, (any character or decimal or digit) repeating and then the
character @ and then (any character or digit) repeating and then the
character . and then (any character or digit) repeating.

 

I think this will do but I wonder if there is a more comprehensive
pattern that I should be using. My purpose is to catch typos by the
office staff (spaces, for example).

 

Thanks,

Jim

 

Jim Hamer

 

jhamer at milk.org

(905) 821-8970 ext 233

 

jim.hamer at hamerdata.com

(416) 489-3298

 

www.hamerdata.com

 

 


______________________________________________________________________
This email has been scanned by the MessageLabs Email Security System.
______________________________________________________________________

This email has been scanned by the MessageLabs Email Security System.
______________________________________________________________________


This message and any attachments (the 'message') is intended solely for the
addressees and is confidential. If you receive this message in error, please
delete it and immediately notify the sender. Any use not in accord with its
purpose, any dissemination or disclosure, either whole or partial, is
prohibited, except with formal approval. The Internet cannot guarantee the
integrity of this message. Orbit (and its subsidiaries) shall therefore not
be liable for the message, if modified. Orbit accepts no liability for any
damage caused by any error or virus transmitted by this email.



-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.sowder.com/pipermail/powerh-l/attachments/20060831/b804637b/attachment.html


More information about the powerh-l mailing list