REGEX Function (LibreOffice Calc)

Text Advanced LibreOffice Calc Introduced in LibreOffice 6.2

text regex pattern-matching extraction replacement validation parsing

The REGEX function in LibreOffice Calc performs powerful regular-expression-based text extraction, replacement, and validation. It is one of the most flexible and capable text-processing tools available.

Compatibility

▾

Excel	✔
Gnumeric	✔
Google_sheets	✔
Libreoffice	✔
Numbers	✖
Onlyoffice	✔
Openoffice	✖
Wps	✔
Zoho	✔

What the REGEX Function Does ▾

Extracts text matching a pattern
Replaces text matching a pattern
Validates text against a pattern
Supports capturing groups
Supports backreferences in replacements
Works with text, numbers (converted to text), and formulas

It is designed to be powerful, expressive, and ideal for advanced text manipulation.

Syntax ▾

REGEX(text; pattern; replacement; [flags])

Arguments

text:
The text to process.
pattern:
A regular expression defining what to match.
replacement:
The replacement text.
- Use empty string "" for extraction mode
- Use $1, $2, etc. for captured groups
flags: (optional)
- “g” → global (replace all matches)
- “i” → case-insensitive
- “m” → multi-line
- Combine flags as needed (e.g., “gi”)

Modes of Operation ▾

1. Extraction Mode

Use replacement = "" to return the matched text.

2. Replacement Mode

Provide replacement text to transform the input.

3. Validation Mode

Use patterns like ^...$ to test full-string matches.

Basic Examples ▾

Extract digits

=REGEX("A123B"; "\d+"; "")

Returns "123".

Extract first word

=REGEX(A1; "^\w+"; "")

Replace digits with X

=REGEX("A123B"; "\d+"; "X")

Returns "AXB".

Remove all non-digits

=REGEX(A1; "\D+"; ""; "g")

Case-insensitive replacement

=REGEX("Hello WORLD"; "world"; "Earth"; "i")

Advanced Examples ▾

Extract email username

=REGEX(A1; "^[^@]+"; "")

Extract email domain

=REGEX(A1; "@(.+)$"; "$1")

Extract file extension

=REGEX(A1; "\.([^.]+)$"; "$1")

Extract everything before extension

=REGEX(A1; "\.[^.]+$"; "")

Extract numbers from mixed text

=REGEX(A1; "\d+"; "")

Extract the nth number using capturing groups

=REGEX(A1; "(?:\D*\d+){" & B1 & "}"; "")

Normalize whitespace

=REGEX(A1; "\s+"; " "; "g")

Remove HTML tags

=REGEX(A1; "<[^>]+>"; ""; "g")

Extract quoted text

=REGEX(A1; "\"([^\"]*)\""; "$1")

Extract IPv4 octets

=REGEX(A1; "(\d+)\.(\d+)\.(\d+)\.(\d+)"; "$1|$2|$3|$4")

Validate a phone number

=REGEX(A1; "^\d{3}-\d{3}-\d{4}$"; "valid"; "i")

Validate an email

=REGEX(A1; "^[^@]+@[^@]+\.[^@]+$"; "valid")

Extract all uppercase sequences

=REGEX(A1; "[A-Z]+"; "")

Extract the last number

=REGEX(A1; ".*?(\d+)$"; "$1")

Edge Cases and Behavior Details ▾

Empty replacement → extraction mode

No match → returns original text

Unless pattern anchored with ^…$.

Capturing groups use $1, $2, etc.

Global flag “g” required for multiple replacements

Multi-line mode “m” affects ^ and $

text is a number → converted to text

pattern errors → Err:509

REGEX of an error propagates the error

Unicode support varies by engine

Greedy vs. lazy quantifiers matter

.* → greedy
.*? → lazy

Common Errors and Fixes ▾

Err:509 — Invalid pattern

Cause:

Unescaped characters
Missing parentheses
Invalid quantifiers

Unexpected matches

Cause:

Greedy quantifiers
Missing anchors
Case sensitivity mismatch

Fix:

Use ? for lazy matching
Add ^ and $
Use “i” flag

Replacement not applied everywhere

Cause:

Missing “g” flag

Hidden characters break patterns

Fix:

Use CLEAN or SUBSTITUTE

Best Practices ▾

Use REGEX for complex parsing and cleanup
Use capturing groups for structured extraction
Use lazy quantifiers to avoid overmatching
Use anchors for validation
Use TRIM and CLEAN before applying REGEX
Combine REGEX with TEXTBEFORE/TEXTAFTER for hybrid parsing
Use global flag “g” for multi-match replacement

REGEX is your precision scalpel — perfect for advanced extraction, cleanup, validation, and transformation workflows that no other Calc function can match.

Related Patterns and Alternatives ▾

Use TEXTBEFORE and TEXTAFTER for simple delimiter parsing
Use SEARCH and FIND for positional extraction
Use LEFT, RIGHT, MID for manual slicing
Use SUBSTITUTE for simple replacements
Use SPLIT for array-based decomposition

By mastering REGEX and its companion functions, you can build extremely powerful, flexible, and expressive text‑processing workflows in LibreOffice Calc.

Compatibility

What the REGEX Function Does ▾

Syntax ▾

Arguments

Modes of Operation ▾

1. Extraction Mode

2. Replacement Mode

3. Validation Mode

Basic Examples ▾

Extract digits

Extract first word

Replace digits with X

Remove all non-digits

Case-insensitive replacement

Advanced Examples ▾

Extract email username

Extract email domain

Extract file extension

Extract everything before extension

Extract numbers from mixed text

Extract the nth number using capturing groups

Normalize whitespace

Remove HTML tags

Extract quoted text

Extract IPv4 octets

Validate a phone number

Validate an email

Extract all uppercase sequences

Extract the last number

Edge Cases and Behavior Details ▾

Empty replacement → extraction mode

No match → returns original text

Capturing groups use $1, $2, etc.

Global flag “g” required for multiple replacements

Multi-line mode “m” affects ^ and $

text is a number → converted to text

pattern errors → Err:509

REGEX of an error propagates the error

Unicode support varies by engine

Greedy vs. lazy quantifiers matter

Common Errors and Fixes ▾

Err:509 — Invalid pattern

Unexpected matches

Replacement not applied everywhere

Hidden characters break patterns

Best Practices ▾

Related Patterns and Alternatives ▾

Related Functions