What is the difference between a greedy and a lazy quantifier?

Greedy quantifiers (*, +, ?) match as much as possible. With the pattern applied to " bold ", a greedy .* matches everything from the first , consuming the entire string. Lazy quantifiers (*?, +?, ??) match as little as possible. The pattern on the same input matches each tag individually: , then . Add a ? after any quantifier to make it lazy.

Why doesn't my regex match across multiple lines?

By default, the dot (.) does not match newline characters, and ^ / $ match the start and end of the entire string. This means a pattern that assumes a single line will not match a multi-line input. To fix this: use the s (dot-all) flag to make . match newlines, and the m (multiline) flag to make ^ / $ match the start and end of each line. In JavaScript: /pattern/sm.

How do I match a literal dot or special character?

Escape it with a backslash: \. matches a literal dot, \* matches a literal asterisk. The special characters that need escaping are: . * + ? ^ $ { } [ ] ( ) | \ Inside a character class [...], most special characters lose their special meaning: [.] matches a literal dot. The exceptions inside a class are ], \, ^, and -.

What are the performance implications of lookaheads?

Lookaheads and lookbehinds are zero-width — they do not consume characters. For most patterns, the performance impact is negligible. Performance problems arise from catastrophic backtracking, not lookaheads per se. Patterns like (a+)+ applied to a long non-matching string cause exponential backtracking. Use atomic groups or possessive quantifiers (not available in JavaScript) to prevent this, or restructure the pattern to avoid nested quantifiers.

Regex Cheatsheet

A quick reference for regular expression syntax: character classes, quantifiers, anchors, lookarounds, flags, and common patterns.

4 min read·Updated June 2026

A quick-reference for regular expression syntax. For a conceptual introduction, see the What is a Regular Expression? guide.

Character classes

Token	Matches
`.`	Any character except newline (use /s flag to include newline)
`\d`	Digit — [0-9]
`\D`	Non-digit — [^0-9]
`\w`	Word character — [a-zA-Z0-9_]
`\W`	Non-word character
`\s`	Whitespace — space, tab, newline, carriage return
`\S`	Non-whitespace
`[abc]`	Character class — a, b, or c
`[^abc]`	Negated class — any character except a, b, or c
`[a-z]`	Range — any lowercase letter
`[a-zA-Z0-9]`	Alphanumeric

Quantifiers

Token	Meaning	Note
`*`	Zero or more	Greedy
`+`	One or more	Greedy
`?`	Zero or one	Greedy
`{n}`	Exactly n times
`{n,}`	At least n times	Greedy
`{n,m}`	Between n and m times	Greedy
`*?`	Zero or more	Lazy
`+?`	One or more	Lazy

Add ? after any quantifier to make it lazy (match as few as possible): .*? .+? {n,m}?

Anchors & boundaries

Token	Matches at
`^`	Start of string (or start of line with /m flag)
`$`	End of string (or end of line with /m flag)
`\b`	Word boundary — between \w and \W
`\B`	Non-word boundary
`\A`	Start of string (Python/Java — no /m effect)
`\Z`	End of string (Python/Java — no /m effect)

Groups & alternation

Token	Meaning
`(abc)`	Capturing group — captures the match as group 1, 2, …
`(?:abc)`	Non-capturing group — groups without capturing
`(?<name>abc)`	Named capturing group — accessible as match.groups.name
`\1`	Backreference to group 1
`\k<name>`	Backreference to named group
`a\|b`	Alternation — matches a or b

Lookaheads & lookbehinds

Token	Meaning	Example
`(?=...)`	Positive lookahead	`foo(?=bar) — "foo" followed by "bar"`
`(?!...)`	Negative lookahead	`foo(?!bar) — "foo" NOT followed by "bar"`
`(?<=...)`	Positive lookbehind	`(?<=\$)\d+ — digits preceded by "$"`
`(?<!...)`	Negative lookbehind	`(?<!un)happy — "happy" NOT preceded by "un"`

Lookarounds are zero-width — they assert a condition without consuming characters.

Flags

Flag	Name	Effect
`g`	Global	Find all matches, not just the first
`i`	Case-insensitive	[a-z] also matches [A-Z]
`m`	Multiline	^ and $ match start/end of each line
`s`	Dot-all	. also matches newline characters
`u`	Unicode	Enables full Unicode matching and \u{XXXX} escapes
`y`	Sticky	Matches only at lastIndex position (JS)

Common patterns

Pattern	Regex	Notes
Email (simple)	`[a-zA-Z0-9._%+\-]+@[a-zA-Z0-9.\-]+\.[a-zA-Z]{2,}`	Not RFC 5322 compliant — use a library for strict validation
URL (http/https)	`https?:\/\/[\w\-]+(\.[\w\-]+)+([\w\-\._~:/?#[\]@!$&'()+,;=%])?`	Simple — does not handle all edge cases
IPv4 address	`(?:\d{1,3}\.){3}\d{1,3}`	Does not validate range (0-255)
IPv4 address (strict)	`(?:(?:25[0-5]\|2[0-4]\d\|[01]?\d\d?)\.){3}(?:25[0-5]\|2[0-4]\d\|[01]?\d\d?)`	Validates 0-255 range
Phone (intl)	`\+?[\d\s\-().]{7,20}`	Flexible — adapt to your requirements
Date (YYYY-MM-DD)	`\d{4}-(?:0[1-9]\|1[0-2])-(?:0[1-9]\|[12]\d\|3[01])`	Does not check calendar validity
Hex color	`#(?:[0-9a-fA-F]{3}){1,2}`	Matches #rgb and #rrggbb
Slug (URL-safe)	`[a-z0-9]+(?:-[a-z0-9]+)*`	Lowercase alphanumeric + hyphens
Semver	`\d+\.\d+\.\d+(?:-[\w.]+)?(?:\+[\w.]+)?`	Simplified — see official semver regex for full spec
UUID v4	`[0-9a-f]{8}-[0-9a-f]{4}-4[0-9a-f]{3}-[89ab][0-9a-f]{3}-[0-9a-f]{12}`	Case-insensitive — add i flag
Credit card (Luhn)	`(?:4\d{12}(?:\d{3})?\|5[1-5]\d{14}\|3[47]\d{13})`	Structural only — use Luhn algorithm for real validation

Language differences

Feature	JavaScript	Python	Go	Java
Syntax	`/pattern/flags`	`re.compile()`	`regexp.MustCompile()`	`Pattern.compile()`
Lookahead	`✓`	`✓`	`✗`	`✓`
Lookbehind	`✓`	`✓`	`✗`	`✓`
Named groups	`(?<name>)`	`(?P<name>)`	`(?P<name>)`	`(?<name>)`
Non-greedy	`.*?`	`.*?`	`.*?`	`.*?`
Unicode class	`\p{L} (with v flag)`	`\p{L}`	`\p{L}`	`\p{L}`