Question 1

What is a regular expression (regex)?

Accepted Answer

A regular expression is a sequence of characters that defines a search pattern. You can use regex to find, validate, extract, or replace text — for example, validating that an email address has the right format, or extracting all phone numbers from a document.

Question 2

How do I learn regex?

Accepted Answer

Start with the basics: literal characters match themselves, `.` matches any character, `\d` matches a digit, `*` means zero or more. Practice with a tool like this tester, then try regexr.com or regular-expressions.info for deeper tutorials.

Question 3

What are the most common regex patterns?

Accepted Answer

Email: /^[\w.-]+@[\w.-]+\.\w{2,}$/ · Phone (US): /$?\d{3}$?[-. ]?\d{3}[-. ]?\d{4}/ · URL: /https?://[\w.-]+(?:/[\w.-]*)*/ · Date (YYYY-MM-DD): /\d{4}-\d{2}-\d{2}/ · ZIP code: /\d{5}(-\d{4})?/

Question 4

What are capture groups in regex?

Accepted Answer

Capture groups, written with parentheses `(pattern)`, let you extract specific parts of a match. For example, `/([\w.-]+)@([\w.-]+)/` applied to 'user@example.com' gives group 1 = 'user' and group 2 = 'example.com'. Non-capturing groups use `(?:pattern)` — they group without capturing.

Question 5

When should I use regex vs string methods?

Accepted Answer

Use string methods (`includes`, `startsWith`, `split`, `replace`) for simple operations — they're faster and more readable. Switch to regex when you need pattern matching (find any email), multiple replacements with the `g` flag, or validation (ensure a string has exactly the right format).

Question 6

Why does regex have so many syntax variations?

Accepted Answer

Different tools and languages implemented regex independently in the 1970s-90s with their own conventions. POSIX BRE (basic), POSIX ERE (extended), PCRE (Perl-Compatible), JavaScript, Python's re, .NET regex, Java regex — each slightly different. Most modern languages and tools (JavaScript, Python, Java, .NET, Go, PHP, Ruby) use PCRE-compatible or similar syntax. Older tools (grep, sed, vim) use POSIX BRE. The differences are usually minor (escape character handling, named groups syntax, some less-common features). For most everyday use, the basic syntax works across all flavors; advanced features may require flavor-specific code.

Question 7

Can I use regex to validate email addresses?

Accepted Answer

Yes, but imperfectly. The full email format spec (RFC 5322) is complex; a 'perfect' validation regex is enormous. Practical patterns like ^[\w.+-]+@[\w-]+\.[\w.-]+$ handle 99% of valid emails and reject most invalid. The truly correct approach: simple regex to reject obvious garbage, then send a confirmation email — the only way to know an email is valid is whether it can receive mail. Never rely on regex alone for high-stakes email validation.

Question 8

What's the difference between greedy and lazy regex?

Accepted Answer

Greedy: quantifiers (*, +) match as much as possible. .* against "a [first] and b [second]" matches the entire string. Lazy: quantifiers (*?, +?) match as little as possible. .*? against same string matches only what's needed. Example: extracting text between brackets. Greedy: $$.*$$ matches "[first] and b [second]" (everything from first [ to last ]). Lazy: $$.*?$$ matches "[first]" then "[second]" separately. For matching content between delimiters: lazy is almost always what you want.

Question 9

Are regular expressions case-sensitive?

Accepted Answer

By default, yes. The i flag enables case-insensitive matching. /Hello/ matches "Hello" but not "hello" or "HELLO". /Hello/i matches all three. Most languages and tools support the i flag. Some use different syntax: Python uses re.IGNORECASE, .NET uses RegexOptions.IgnoreCase, but the underlying behavior is the same. For matching usernames, hashtags, file extensions — anywhere case shouldn't matter — always use the case-insensitive flag.

Question 10

What is catastrophic backtracking?

Accepted Answer

A regex performance failure where certain patterns take exponential time on certain inputs. Classic example: (a+)+b against "aaaaaaaaaaaaaaaaaaaaa" (no trailing b). The regex engine tries an enormous number of combinations before giving up. Such patterns can crash applications by consuming all CPU. Common causes: nested quantifiers ((x+)+, (x*)*), alternative branches that overlap ((a|aa)*). For production use: test regex patterns against large inputs; consider RE2-compatible patterns (Go's regex doesn't support backreferences but is guaranteed linear time); fail fast with timeouts on user-input regex. RegexBuddy, RegEx101, and the Mubboo regex tester help identify performance problems before deployment.

Free Regex Tester — Test Regular Expressions in Real Time

More Developer & Design tools

What is this calculator for?

How to use this calculator

Understanding your results

A worked example

Related resources

Frequently Asked Questions

Sources