What's the best regex to validate an email address?

For almost all applications, a simple pattern like /^[^\s@]+@[^\s@]+\.[^\s@]+$/ is the right choice — it confirms there's a local part, an @, and a domain with a dot, which catches the obvious typos. Anything stricter rejects valid addresses and still can't confirm the mailbox is real. The actual validation that matters is sending a confirmation email and having the user click the link.

Why shouldn't I use a strict RFC 5322 email regex?

The complete RFC 5322 grammar allows quoted strings, comments, and other rarely-used constructs, so a faithful regex is hundreds of characters long, hard to read, and frequently still rejects legitimate addresses (plus-addressing, new long TLDs, internationalized domains). Worse, naive 'strict' patterns often contain nested quantifiers that cause catastrophic backtracking (ReDoS). A simple format check plus an email confirmation is more correct and far safer.

Email Validation Regex: Patterns That Actually Work

Email validation is one of the most common tasks in software and one of the most frequently botched.

Search Stack Overflow for “email regex” and you’ll find thousands of conflicting answers. Some suggest a five-character pattern; others paste a 6,300-character monster claiming perfect RFC compliance.

The reality is that most email validation is flawed. Teams ship patterns that are too strict and block legitimate users — hurting conversion. They reject plus-addressing (user+newsletter@gmail.com), or they fail to support the Internationalized Domain Names used across Asia.

The root cause is that the governing spec, RFC 5322, is genuinely complex. It permits quoted strings, nested comments, IP address literals (user@[192.168.1.1]), and even whitespace in the local part.

This guide gives you tested, production-ready patterns you can use today. It covers the trade-off between strictness and usability, the flaws in common patterns, and why the final validation step always has to be a confirmation link.

Test your regex safely. Don’t deploy a validation string blind. Paste it into our local Regex Tester and run it against edge cases in real time.

1. The Simple Pattern (Recommended for 95% of Apps)

For most web apps, e-commerce platforms, and SaaS products, this lightweight pattern offers the best balance of accuracy, performance, and low user friction:

^[^\s@]+@[^\s@]+\.[^\s@]+$

Breaking down the syntax:

^ : start of the string.
[^\s@]+ : one or more characters that are NOT whitespace and NOT @ (the local part).
@ : the literal @.
[^\s@]+ : the domain portion.
\. : a literal dot.
[^\s@]+ : the top-level domain portion.
$ : end of the string.

Valid matches: user@example.com, john.doe+aws@sub.domain.co.uk, 名前@example.jp Rejected: user@example, user @example.com, @example.com

Why this works

The philosophy here is deliberate ignorance. It validates the shape of an email (string @ string . string) without assuming which Unicode characters the IETF will allow next year.

This is the same approach the W3C recommends for HTML5’s <input type="email" />. It accepts anything that looks structurally like an email and relies on the mail delivery system to verify existence.

2. The Strict Pattern (for Legacy Systems)

If you control the email format in a closed internal system, or you’re dealing with legacy databases that reject non-ASCII characters, you can use a stricter pattern:

^[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-zA-Z]{2,}$

The constraints:

Local part (before @): letters, digits, dots, underscores, percents, pluses, and hyphens.
Domain part (after @): letters, digits, dots, and hyphens.
TLD: two or more letters.

The flaw in the strict pattern

This pattern looks professional, but it has a problem for global apps: it rejects Internationalized Domain Names (EAI / RFC 6531).

In 2012, the IETF ratified non-ASCII Unicode characters in email addresses. Addresses like user@例え.jp are valid, routable, and used by tens of millions of people in Asia. The strict pattern rejects every one of them with an “Invalid Email Format” error.

3. Regex Anti-Patterns to Avoid

Don’t copy patterns from old forums without understanding them. Here are three harmful ones still found in production.

The “3-character TLD” constraint

# ❌ Rejects modern TLDs
\.[a-z]{2,4}$

This enforces a TLD of 2–4 characters. That was roughly true before 2014, but ICANN has since launched hundreds of long gTLDs. This blocks .museum (6), .technology (10), .photography (11), and .engineer (8). Never hardcode TLD lengths.

The “no plus addressing” constraint

# ❌ Rejects Gmail/Fastmail tagging
^[a-zA-Z0-9._-]+@[a-zA-Z0-9.-]+\.[a-zA-Z]{2,}$

Note the missing + in the first bracket. Many power users rely on plus-addressing (user+newsletter@gmail.com) to create trackable aliases that route to the base inbox. Reject the + and those users abandon your signup form.

The “perfect” RFC 5322 pattern and ReDoS

There’s an infamous 6,300-character regex on Stack Overflow that claims to implement all of RFC 5322.

Don’t use it. It’s unmaintainable, and it forces the regex engine through nested lookaheads and heavy backtracking. That opens you to a ReDoS (Regular Expression Denial of Service) attack: a crafted 50-character string can make the engine run for minutes, locking up the Node.js event loop and taking down your API.

It also validates edge cases like "quoted string"@example.com and user@[192.168.1.1]. These are technically “legal” in the spec, but mail services like SendGrid, Mailgun, and AWS SES reject them anyway.

4. Length Constraints

Regex engines are poor at counting across string segments, so back up your structural check with length validation in your backend language.

The SMTP protocol (RFC 5321) defines maximum lengths:

Component	Maximum Length	RFC Reference
Total address	254 characters	RFC 5321 §4.5.3.1.3
Local part (before @)	64 characters	RFC 5321 §4.5.3.1.1
Domain part (after @)	253 characters	RFC 1035 §2.3.4

If someone submits a 5,000-character address, reject it with a length() check before it reaches the regex engine. That’s a basic security measure.

5. Implementations by Language

1. Node.js / JavaScript

Frontend validation can rely on the browser, but backend validation needs explicit checks.

// Combines length checks with the simple regex
function validateEmail(emailInput) {
  // 1. Normalize
  const email = emailInput.trim().toLowerCase();
  
  // 2. Reject oversized input to prevent ReDoS CPU exhaustion
  if (email.length > 254) {
    return { valid: false, reason: "Exceeds the RFC length limit." };
  }
  
  const [localPart, domainPart] = email.split('@');
  
  // 3. Check both parts exist
  if (!localPart || !domainPart) {
    return { valid: false, reason: "Missing local part or domain." };
  }
  
  // 4. Length constraints
  if (localPart.length > 64) {
    return { valid: false, reason: "Local part exceeds 64 characters." };
  }
  
  // 5. Structural regex (the simple pattern)
  const pattern = /^[^\s@]+@[^\s@]+\.[^\s@]+$/;
  if (!pattern.test(email)) {
    return { valid: false, reason: "Fails the structural regex check." };
  }
  
  return { valid: true, normalized: email };
}

2. Python (FastAPI / Django)

Don’t rely on the base re module alone for boundary checks. Use a well-tested validator or a framework like pydantic (which powers FastAPI).

import re

def is_valid_email(email_string: str) -> bool:
    """A production-ready email validator."""
    email_string = email_string.strip().lower()
    
    # 1. Guard against ReDoS
    if len(email_string) > 254:
        return False
        
    # 2. Structural split
    parts = email_string.split('@')
    if len(parts) != 2:
        return False
        
    local_part, domain_part = parts
    if len(local_part) > 64 or len(local_part) == 0:
        return False
        
    # 3. Simple structural regex
    pattern = r'^[^\s@]+@[^\s@]+\.[^\s@]+$'
    return bool(re.match(pattern, email_string))

3. HTML5 (no JavaScript)

Browsers have validation engines built into the DOM. You don’t need a JavaScript library to validate an email on the frontend.

<form>
  <!-- The browser enforces W3C email validation natively -->
  <input type="email" name="user_email" required maxlength="254" />
  <button type="submit">Register Account</button>
</form>

With type="email", the browser blocks submission, outlines the input in red, and shows a localized error (“Please enter an email address”) for free. Use it as your first line of defense, backed by server-side logic.

6. The Golden Rule: Closed-Loop Verification

Regex validates the format; it doesn’t validate existence.

Someone can submit perfectly.valid.format@this-domain-is-fake.com. It passes the W3C spec, the HTML5 spec, and the most complex RFC 5322 regex ever written.

The only way to truly verify an email is closed-loop verification:

Accept the address with a simple, permissive regex on the backend.
Generate a secure, time-limited token.
Send a verification link with that token via AWS SES or SendGrid.
If the user clicks it, you have proof that the email exists, the domain works, and the user controls the inbox.

Treat regex as a typo-catcher, not a security boundary.

Recent Activity

Email Validation Regex: Patterns That Actually Work

Email Validation Regex: Patterns That Actually Work

1. The Simple Pattern (Recommended for 95% of Apps)

Why this works

2. The Strict Pattern (for Legacy Systems)

The flaw in the strict pattern

3. Regex Anti-Patterns to Avoid

The “3-character TLD” constraint

The “no plus addressing” constraint

The “perfect” RFC 5322 pattern and ReDoS

4. Length Constraints

5. Implementations by Language

1. Node.js / JavaScript

2. Python (FastAPI / Django)

3. HTML5 (no JavaScript)

6. The Golden Rule: Closed-Loop Verification

Further Reading

Related Tools — Try Them Now

Related Articles

Phone Number Regex Patterns by Country

Regex Cheat Sheet: Quick Reference for Developers

Regular Expressions: The Complete Guide