Regex Match String

This chapter introduces regular expression support for Oracle Database. This chapter covers the following topics:

If you need to know if a string matches a regular expression RegExp, use RegExp.test. If you only want the first match found, you might want to use RegExp.exec instead. If you want to obtain capture groups and the global flag is set, you need to use RegExp.exec instead. Like strings, regexps use the backslash, to escape special behaviour. So to match an., you need the regexp. Unfortunately this creates a problem. We use strings to represent regular expressions, and is also used as an escape symbol in strings. So to create the regular expression. We need the string '.'

What Are Regular Expressions?

Regular expressions enable you to search for patterns in string data by using standardized syntax conventions. You specify a regular expression by means of the following types of characters:

Metacharacters, which are operators that specify search algorithms
Literals, which are the characters for which you are searching

A regular expression can specify complex patterns of character sequences. For example, the following regular expression searches for the literals f or ht, the t literal, the p literal optionally followed by the s literal, and finally the colon (:) literal:

The parentheses are metacharacters that group a series of pattern elements to a single element; the pipe symbol (|) matches one of the alternatives in the group. The question mark (?) is a metacharacter indicating that the preceding pattern, in this case the s character, is optional. Thus, the preceding regular expression matches the http:, https:, ftp:, and ftps: strings.

How Are Oracle Database Regular Expressions Useful?

Regular expressions are a powerful text processing component of programming languages such as Perl and Java. For example, a Perl script can process each HTML file in a directory, read its contents into a scalar variable as a single string, and then use regular expressions to search for URLs in the string. One reason that many developers write in Perl is for its robust pattern matching functionality.

Oracle's support of regular expressions enables developers to implement complex match logic in the database. This technique is useful for the following reasons:

By centralizing match logic in Oracle Database, you avoid intensive string processing of SQL results sets by middle-tier applications. For example, life science customers often rely on Perl to do pattern analysis on bioinformatics data stored in huge databases of DNAs and proteins. Previously, finding a match for a protein sequence such as [AG].{4}GK[ST] would be handled in the middle tier. The SQL regular expression functions move the processing logic closer to the data, thereby providing a more efficient solution.
Prior to Oracle Database 10g, developers often coded data validation logic on the client, requiring the same validation logic to be duplicated for multiple clients. Using server-side regular expressions to enforce constraints solves this problem.
The built-in SQL and PL/SQL regular expression functions and conditions make string manipulations more powerful and less cumbersome than in previous releases of Oracle Database.

Oracle Database Implementation of Regular Expressions

Oracle Database implements regular expression support with a set of Oracle Database SQL functions and conditions that enable you to search and manipulate string data. You can use these functions in any environment that supports Oracle Database SQL. You can use these functions on a text literal, bind variable, or any column that holds character data such as CHAR, NCHAR, CLOB, NCLOB, NVARCHAR2, and VARCHAR2 (but not LONG).

Table 4-1 gives a brief description of the regular expression functions and conditions.

Table 4-1 SQL Regular Expression Functions and Conditions

SQL Element	Category	Description
Searches a character column for a pattern. Use this function in the `WHERE` clause of a query to return rows matching a regular expression. The condition is also valid in a constraint or as a PL/SQL function returning a boolean. The following `WHERE` clause filters employees with a first name of Steven or Stephen:
Searches for a pattern in a character column and replaces each occurrence of that pattern with the specified string. The following function puts a space after each character in the `country_name` column:
Searches a string for a given occurrence of a regular expression pattern and returns an integer indicating the position in the string where the match is found. You specify which occurrence you want to find and the start position. For example, the following performs a boolean test for a valid email address in the `email` column:
Returns the substring matching the regular expression pattern that you specify. The following function uses the `x` flag to match the first string by ignoring spaces in the regular expression:

A string literal in a REGEXP function or condition conforms to the rules of SQL text literals. By default, regular expressions must be enclosed in single quotes. If your regular expression includes the single quote character, then enter two single quotation marks to represent one single quotation mark within the expression. This technique ensures that the entire expression is interpreted by the SQL function and improves the readability of your code. You can also use the q-quote syntax to define your own character to terminate a text literal. For example, you could delimit your regular expression with the pound sign (#) and then use a single quote within the expression.

Note:

If your expression comes from a column or a bind variable, then the same rules for quoting do not apply.

Oracle Database Support for the POSIX Regular Expression Standard

Oracle's implementation of regular expressions conforms to the following standards:

IEEE Portable Operating System Interface (POSIX) standard draft 1003.2/D11.2
Unicode Regular Expression Guidelines of the Unicode Consortium

Oracle Database follows the exact syntax and matching semantics for these operators as defined in the POSIX standard for matching ASCII (English language) data. You can find the POSIX standard draft at the following URL:

Oracle Database enhances regular expression support in the following ways:

Extends the matching capabilities for multilingual data beyond what is specified in the POSIX standard.
Adds support for the common Perl regular expression extensions that are not included in the POSIX standard but do not conflict with it. Oracle Database provides built-in support for some of the most heavily used Perl regular expression operators, for example, character class shortcuts, the non-greedy modifier, and so on.

Oracle Database supports a set of common metacharacters used in regular expressions. The behavior of supported metacharacters and related features is described in 'Regular Expression Metacharacters in Oracle Database'.

Note:

The interpretation of metacharacters differs between tools that support regular expressions. If you are porting regular expressions from another environment to Oracle Database, ensure that the regular expression syntax is supported and the behavior is what you expect.

Regular Expression Metacharacters in Oracle Database

This section contains the following topics:

POSIX Metacharacters in Oracle Database Regular Expressions

Table 4-2 lists the list of metacharacters supported for use in regular expressions passed to SQL regular expression functions and conditions. These metacharacters conform to the POSIX standard; any differences in behavior from the standard are noted in the 'Description' column.

Table 4-2 POSIX Metacharacters in Oracle Database Regular Expressions

Syntax	Operator Name	Description	Example
`.`	Any Character — Dot	Matches any character in the database character set. If the `n` flag is set, it matches the newline character. The newline is recognized as the linefeed character (`x0a`) on UNIX and Windows or the carriage return character (`x0d`) on Macintosh platforms. Note: In the POSIX standard, this operator matches any English character except NULL and the newline character.	The expression `a.b` matches the strings `abb`, `acb`, and `adb`, but does not match `acc`.
`+`	One or More — Plus Quantifier	Matches one or more occurrences of the preceding subexpression.	The expression `a+` matches the strings `a`, `aa`, and aaa, but does not match `bbb`.
`?`	Zero or One — Question Mark Quantifier	Matches zero or one occurrence of the preceding subexpression.	The expression `ab?c` matches the strings `abc` and `ac`, but does not match `abbc`.
`*`	Zero or More — Star Quantifier	Matches zero or more occurrences of the preceding subexpression. By default, a quantifier match is greedy because it matches as many times as possible while still allowing the rest of the match to succeed.	The expression `ab*c` matches the strings `ac`, `abc`, and `abbc`, but does not match `abb`.
`{m}`	Interval—Exact Count	Matches exactly `m` occurrences of the preceding subexpression.	The expression `a{3}` matches the strings `aaa`, but does not match `aa`.
`{m,}`	Interval—At Least Count	Matches at least `m` occurrences of the preceding subexpression.	The expression `a{3,}` matches the strings `aaa` and `aaaa`, but does not match `aa`.
`{m,n}`	Interval—Between Count	Matches at least `m`, but not more than `n` occurrences of the preceding subexpression.	The expression `a{3,5}` matches the strings `aaa`, `aaaa`, and `aaaaa`, but does not match `aa`.
`[ .. ]`	Matching Character List	Matches any single character in the list within the brackets. The following operators are allowed within the list, but other metacharacters included are treated as literals: Range operator: `-` POSIX character class: `[: :]` POSIX collation element: `[. .]` POSIX character equivalence class: `[= =]` A dash (`-`) is a literal when it occurs first or last in the list, or as an ending range point in a range expression, as in `[#--]`. A right bracket (`]`) is treated as a literal if it occurs first in the list. Note: In the POSIX standard, a range includes all collation elements between the start and end of the range in the linguistic definition of the current locale. Thus, ranges are linguistic rather than byte values ranges; the semantics of the range expression are independent of character set. In Oracle Database, the linguistic range is determined by the `NLS_SORT` initialization parameter.	The expression `[abc]` matches the first character in the strings `all`, `bill`, and `cold`, but does not match any characters in `doll`.
`[^ .. ]`	Non-Matching Character List	Matches any single character not in the list within the brackets. Characters not in the non-matching character list are returned as a match. Refer to the description of the Matching Character List operator for an account of metacharacters allowed in the character list.	The expression `[^abc]` matches the character `d` in the string `abcdef`, but not the character `a`, `b`, or `c`. The expression `[^abc]+` matches the sequence `def` in the string `abcdef`, but not `a`, `b`, or `c`. The expression `[^a-i]` excludes any character between `a` and `i` from the search result. This expression matches the character `j` in the string `hij`, but does not match any characters in the string `abcdefghi`.
`\|`	Or	Matches one of the alternatives.	The expression `a\|b` matches character `a` or character `b`.
`( .. )`	Subexpression or Grouping	Treats the expression within parentheses as a unit. The subexpression can be a string of literals or a complex expression containing operators.	The expression `(abc)?def` matches the optional string `abc`, followed by `def`. Thus, the expression matches `abcdefghi` and `def`, but does not match `ghi`.
`n`	Backreference	Matches the n^th preceding subexpression, that is, whatever is grouped within parentheses, where `n` is an integer from 1 to 9. The parentheses cause an expression to be remembered; a backreference refers to it. A backreference counts subexpressions from left to right, starting with the opening parenthesis of each preceding subexpression. The expression is invalid if the source string contains fewer than n subexpressions preceding the `n`. Oracle supports the backreference expression in the regular expression pattern and the replacement string of the `REGEXP_REPLACE` function.	The expression `(abc\|def)xy1` matches the strings `abcxyabc` and `defxydef`, but does not match `abcxydef` or `abcxy`. A backreference enables you to search for a repeated string without knowing the actual string ahead of time. For example, the expression `^(.*)1$` matches a line consisting of two adjacent instances of the same string.
	Escape Character	Treats the subsequent metacharacter in the expression as a literal. Use a backslash () to search for a character that is normally treated as a metacharacter. Use consecutive backslashes () to match the backslash literal itself.	The expression `+` searches for the plus character (`+`). It matches the plus character in the string `abc+def`, but does not match `abcdef`.
`^`	Beginning of Line Anchor	Matches the beginning of a string (default). In multiline mode, it matches the beginning of any line within the source string.	The expression `^def` matches `def` in the string `defghi` but does not match `def` in `abcdef`.
`$`	End of Line Anchor	Matches the end of a string (default). In multiline mode, it matches the beginning of any line within the source string.	The expression `def$` matches `def` in the string `abcdef` but does not match `def` in the string `defghi`.
`[:class:]`	POSIX Character Class	Matches any character belonging to the specified POSIX character `class`. You can use this operator to search for characters with specific formatting such as uppercase characters, or you can search for special characters such as digits or punctuation characters. The full set of POSIX character classes is supported. Note: In English regular expressions, range expressions often indicate a character class. For example, `[a-z]` indicates any lowercase character. This convention is not useful in multilingual environments, where the first and last character of a given character class may not be the same in all languages. Oracle supports the character classes in Table 4-3 based on character class definitions in Globalization classification data.	The expression `[[:upper:]]+` searches for one or more consecutive uppercase characters. This expression matches `DEF` in the string `abcDEFghi` but does not match the string `abcdefghi`.
`[.element.]`	POSIX Collating Element Operator	Specifies a collating element to use in the regular expression. The `element` must be a defined collating element in the current locale. Use any collating element defined in the locale, including single-character and multicharacter elements. The `NLS_SORT` initialization parameter determines supported collation elements.This operator lets you use a multicharacter collating element in cases where only one character would otherwise be allowed. For example, you can ensure that the collating element `ch`, when defined in a locale such as Traditional Spanish, is treated as one character in operations that depend on the ordering of characters.	The expression `[[.ch.]]` searches for the collating element `ch` and matches `ch` in string `chabc`, but does not match `cdefg`. The expression `[a-[.ch.]]` specifies the range `a` to `ch`.
`[=character=]`	POSIX Character Equivalence Class	Matches all characters that are members of the same character equivalence class in the current locale as the specified `character`. The character equivalence class must occur within a character list, so the character equivalence class is always nested within the brackets for the character list in the regular expression. Usage of character equivalents depends on how canonical rules are defined for your database locale. Refer to theOracle Database Globalization Support Guide for more information on linguistic sorting and string searching.	The expression `[[=n=]]` searches for characters equivalent to `n` in a Spanish locale. It matches both `N` and `ñ` in the string `El Niño`.

Regular Expression Operator Multilingual Enhancements

When applied to multilingual data, Oracle's implementation of the POSIX operators extends beyond the matching capabilities specified in the POSIX standard. Table 4-3 shows the relationship of the operators in the context of the POSIX standard.

The first column lists the supported operators.
The second column indicates whether the POSIX standard for Basic Regular Expression (BRE) defines the operator.
The third column indicates whether the POSIX standard for Extended Regular Expression (ERE) defines the operator.
The fourth column indicates whether the Oracle Database implementation extends the operator's semantics for handling multilingual data.

Oracle Database lets you enter multibyte characters directly, if you have a direct input method, or use functions to compose the multibyte characters. You cannot use the Unicode hexadecimal encoding value of the form xxxx. Oracle evaluates the characters based on the byte values used to encode the character, not the graphical representation of the character.

Table 4-3 POSIX and Multilingual Operator Relationships

Operator	POSIX BRE syntax	POSIX ERE Syntax	Multilingual Enhancement
Nero 2016 serial numaras? guncel.	Yes	Yes	--
`*`	Yes	Yes	--
`+`	--	Yes	--
`?`	--	Yes	--
`\|`	--	Yes	--
`^`	Yes	Yes	Yes
`$`	Yes	Yes	Yes
`.`	Yes	Yes	Yes
`[ ]`	Yes	Yes	Yes
`( )`	Yes	Yes	--
`{m}`	Yes	Yes	--
`{m,}`	Yes	Yes	--
`{m,n}`	Yes	Yes	--
`n`	Yes	Yes	Yes
`[.]`	Yes	Yes	Yes
`[::]`	Yes	Yes	Yes
`[]`	Yes	Yes	Yes

Perl-Influenced Extensions in Oracle Regular Expressions

Table 4-4 describes Perl-influenced metacharacters supported in Oracle Database regular expression functions and conditions. These metacharacters are not in the POSIX standard, but are common at least partly due to the popularity of Perl. Note that Perl character class matching is based on the locale model of the operating system, whereas Oracle Database regular expressions are based on the language-specific data of the database. In general, a regular expression involving locale data cannot be expected to produce the same results between Perl and Oracle Database.

Table 4-4 Perl-Influenced Extensions in Oracle Regular Expressions

Reg. Exp.	Matches . . .	Example
`d`	A digit character. It is equivalent to the POSIX class `[[:digit:]]`.	The expression `^(d{3}) d{3}-d{4}$` matches `(650) 555-1212` but does not match `650-555-1212`.
`D`	A non-digit character. It is equivalent to the POSIX class `[^[:digit:]]`.	The expression `wdD` matches `b2b` and `b2_` but does not match `b22`.
`w`	A word character, which is defined as an alphanumeric or underscore (`_`) character. It is equivalent to the POSIX class `[[:alnum:]_]`. Note that if you do not want to include the underscore character, you can use the POSIX class `[[:alnum:]]`.	The expression `w+@w+(.w+)+` matches the string `[email protected]` but not the string `jdoe@company`.
`W`	A non-word character. It is equivalent to the POSIX class `[^[:alnum:]_]`.	The expression `w+Wsw+` matches the string `to: bill` but not the string `to bill`.
`s`	A whitespace character. It is equivalent to the POSIX class `[[:space:]]`.	The expression `(wsws)` matches the string `(a b )` but not the string `(ab)`.
`S`	A non-whitespace character. It is equivalent to the POSIX class `[^[:space:]]`.	The expression `(wSwS)` matches the string `(abde)` but not the string `(a b d e)`.
`A`	Only at the beginning of a string. In multi-line mode, that is, when embedded newline characters in a string are considered the termination of a line, `A` does not match the beginning of each line.	The expression `AL` matches only the first `L` character in the string`Line1nLine2n`, regardless of whether the search is in single-line or multi-line mode.
`Z`	Only at the end of string or before a newline ending a string. In multi-line mode, that is, when embedded newline characters in a string are considered the termination of a line, `Z` does not match the end of each line.	In the expression `sZ,` the `s` matches the last space in the string`L i n e n`, regardless of whether the search is in single-line or multi-line mode.
`z`	Only at the end of a string.	In the expression `sz`, the `s` matches the newline in the string`L i n e n`, regardless of whether the search is in single-line or multi-line mode.
`*?`	The preceding pattern element 0 or more times (non-greedy). Note that this quantifier matches the empty string whenever possible.	The expression `w?xw` is 'non-greedy' and so matches `abxc` in the string `abxcxd`. The expression `wxw` is 'greedy' and so matches `abxcxd in the string` `abxcxd`. The expression `w*?xw` also matches the string `xa`.
`+?`	The preceding pattern element 1 or more times (non-greedy).	The expression `w+?xw` is 'non-greedy' and so matches `abxc` in the string `abxcxd`. The expression `w+xw` is 'greedy' and so matches `abxcxd in the string` `abxcxd`. The expression `w+?xw` does not match the string `xa`, but does match the string `axa`.
`??`	The preceding pattern element 0 or 1 time (non-greedy). Note that this quantifier matches the empty string whenever possible.	The expression `a??aa` is 'non-greedy' and matches `aa` in the string `aaaa`. The expression `a?aa` is 'greedy' and so matches `aaa` in the string `aaaa`.
`{n}?`	The preceding pattern element exactly `n` times (non-greedy). In this case `{n}?` is equivalent to `{n}`.	The expression `(a\|aa){2}?` matches `aa` in the string `aaaa`.
`{n,}?`	The preceding pattern element at least `n` times (non-greedy).	The expression `a{2,}?` is 'non-greedy' and matches `aa` in the string `aaaaa`. The expression `a{2,}` is 'greedy' and so matches `aaaa`a.
`{n,m}?`	At least `n` but not more than `m` times (non-greedy). Note that `{0,m}?` matches the empty string whenever possible.	The expression `a{2,4}?` is 'non-greedy' and matches `aa` in the string `aaaaa`. The expression `a{2,4}` is 'greedy' and so matches `aaaa`.

The Oracle Database regular expression functions and conditions support the pattern matching modifiers described in Table 4-5.

Table 4-5 Pattern Matching Modifiers

Mod.	Description	Example
`i`	Specifies case-insensitive matching.	The following regular expression returns `AbCd`:
`c`	Specifies case-sensitive matching.	The following regular expression fails to match:
n	Allows the period (.), which by default does not match newlines, to match the newline character.	The following regular expression matches the string, but would not match if the `n` flag were not specified:
m	Performs the search in multi-line mode. The metacharacter `^` and `$` signify the start and end, respectively, of any line anywhere in the source string, rather than only at the start or end of the entire source string.	The following regular expression returns `ac`:
`x`	Ignores whitespace characters in the regular expression. By default, whitespace characters match themselves.	The following regular expression returns `abcd`:

Using Regular Expressions in SQL Statements: Scenarios

This section contains the following scenarios:

Using an Integrity Constraint to Enforce a Phone Number Format

Regular expressions are a useful way to enforce integrity constraints. For example, suppose that you want to ensure that phone numbers are entered into the database in a standard format. Example 4-1 creates a contacts table and adds a check constraint to the p_number column to enforce the following format mask:

Example 4-1 Enforcing a Phone Number Format with Regular Expressions

Table 4-6 explains the elements of the regular expression.

Table 4-6 Explanation of the Regular Expression Elements in Example 4-1

Regular Expression Element	Matches . . .
`^`	The beginning of the string.
`(`	A left parenthesis. The backward slash () is an escape character that indicates that the left parenthesis following it is a literal rather than a grouping expression.
`d{3}`	Exactly three digits.
`)`	A right parenthesis. The backward slash () is an escape character that indicates that the right parenthesis following it is a literal rather than a grouping expression.
(space character)	A space character.
`d{3}`	Exactly three digits.
`-`	A hyphen.
`d{4}`	Exactly four digits.
`$`	The end of the string.

Example 4-2 shows a SQL script that attempts to insert seven phone numbers into the contacts table. Only the first two INSERT statements use a format that conforms to the p_number_format constraint; the remaining statements generate check constraint errors.

Using Back References to Reposition Characters

As explained in Table 4-2, back references store matched subexpressions in a temporary buffer, thereby enabling you to reposition characters. You access buffers with the n notation, where n is a number between 1 and 9. Each subexpression is contained in parentheses and is numbered from left to right.

Example 4-3 creates a famous_people table and populates the famous_people.names column with names in different formats.

Example 4-3 Using Back References to Reposition Characters

Example 4-4 shows a query that repositions names in the format 'first middle last' to the format 'last, first middle'. It ignores names not in the format 'first middle last'.

Example 4-4 Using Back References to Reposition Characters

Table 4-7 explains the elements of the regular expression.

Table 4-7 Explanation of the Regular Expression Elements in Example 4-4

Regular Expression Element	Description
`^`	Matches the beginning of the string.
`$`	Matches the end of the string.
`(S+)`	Matches one or more non-space characters. The parentheses are not escaped so they function as a grouping expression.
`s`	Matches a whitespace character.
`1`	Substitutes the first subexpression, that is, the first group of parentheses in the matching pattern.
`2`	Substitutes the second subexpression, that is, the second group of parentheses in the matching pattern.
`3`	Substitutes the third subexpression, that is, the third group of parentheses in the matching pattern.
`,`	Inserts a comma character.

Example 4-5 shows the result set of the query in Example 4-4. The regular expression matched only the first two rows.

Example 4-5 Result Set of Regular Expression Query

-->

Definition

Searches an input string for all occurrences of a regular expression and returns all the matches.

Overloads

Matches(String)Matches(String)Matches(String)Matches(String)	Searches the specified input string for all occurrences of a regular expression.
Matches(String, Int32)Matches(String, Int32)Matches(String, Int32)Matches(String, Int32)	Searches the specified input string for all occurrences of a regular expression, beginning at the specified starting position in the string.
Matches(String, String)Matches(String, String)Matches(String, String)Matches(String, String)	Searches the specified input string for all occurrences of a specified regular expression.
Matches(String, String, RegexOptions)Matches(String, String, RegexOptions)Matches(String, String, RegexOptions)Matches(String, String, RegexOptions)	Searches the specified input string for all occurrences of a specified regular expression, using the specified matching options.
Matches(String, String, RegexOptions, TimeSpan)Matches(String, String, RegexOptions, TimeSpan)Matches(String, String, RegexOptions, TimeSpan)Matches(String, String, RegexOptions, TimeSpan)	Searches the specified input string for all occurrences of a specified regular expression, using the specified matching options and time-out interval.

Matches(String)Matches(String)Matches(String)Matches(String)

Searches the specified input string for all occurrences of a regular expression.

Parameters

input: StringStringStringString

The string to search for a match.

Returns

MatchCollectionMatchCollectionMatchCollectionMatchCollection

A collection of the Match objects found by the search. If no matches are found, the method returns an empty collection object.

Exceptions

ArgumentNullExceptionArgumentNullExceptionArgumentNullExceptionArgumentNullException

input is null.

Examples

The following example uses the Matches(String) method to identify any words in a sentence that end in 'es'.

The regular expression pattern bw+esb is defined as shown in the following table.

Pattern	Description
`b`	Begin the match at a word boundary.
`w+`	Match one or more word characters.
`es`	Match the literal string 'es'.
`b`	End the match at a word boundary.

Remarks

The Matches(String) method is similar to the Match(String) method, except that it returns information about all the matches found in the input string, instead of a single match. It is equivalent to the following code:

The collection includes only matches and terminates at the first non-match.

The regular expression pattern for which the Matches(String) method searches is defined by the call to one of the Regex class constructors. For more information about the elements that can form a regular expression pattern, see Regular Expression Language - Quick Reference.

The Matches method uses lazy evaluation to populate the returned MatchCollection object. Accessing members of this collection such as MatchCollection.Count and MatchCollection.CopyTo causes the collection to be populated immediately. To take advantage of lazy evaluation, you should iterate the collection by using a construct such as foreach in C# and For Each…Next in Visual Basic.

Because of its lazy evaluation, calling the Matches(String) method does not throw a RegexMatchTimeoutException exception. However, the exception is thrown when an operation is performed on the MatchCollection object returned by this method, if the MatchTimeout property is not Regex.InfiniteMatchTimeout and a matching operation exceeds the time-out interval.

See also

Matches(String, Int32)Matches(String, Int32)Matches(String, Int32)Matches(String, Int32)

Searches the specified input string for all occurrences of a regular expression, beginning at the specified starting position in the string.

Parameters

input: StringStringStringString

The string to search for a match.

startat: Int32Int32Int32Int32

The character position in the input string at which to start the search.

Returns

MatchCollectionMatchCollectionMatchCollectionMatchCollection

A collection of the Match objects found by the search. If no matches are found, the method returns an empty collection object.

Exceptions

ArgumentNullExceptionArgumentNullExceptionArgumentNullExceptionArgumentNullException

input is null.

ArgumentOutOfRangeExceptionArgumentOutOfRangeExceptionArgumentOutOfRangeExceptionArgumentOutOfRangeException

startat is less than zero or greater than the length of input.

Examples

The following example uses the Match(String) method to find the first word in a sentence that ends in 'es', and then calls the Matches(String, Int32) method to identify any additional words that end in 'es'.

The regular expression pattern bw+esb is defined as shown in the following table.

Pattern	Description
`b`	Begin the match at a word boundary.
`w+`	Match one or more word characters.
`es`	Match the literal string 'es'.
`b`	End the match at a word boundary.

Remarks

The Matches(String, Int32) method is similar to the Match(String, Int32) method, except that it returns information about all the matches found in the input string, instead of a single match. It is equivalent to the following code:

The regular expression pattern for which the Matches(String, Int32) method searches is defined by the call to one of the Regex class constructors. For more information about the elements that can form a regular expression pattern, see Regular Expression Language - Quick Reference.

Because of its lazy evaluation, calling the Matches(String, Int32) method does not throw a RegexMatchTimeoutException exception. However, the exception is thrown when an operation is performed on the MatchCollection object returned by this method, if the MatchTimeout property is not Regex.InfiniteMatchTimeout and a matching operation exceeds the time-out interval.

See also

Matches(String, String)Matches(String, String)Matches(String, String)Matches(String, String)

Searches the specified input string for all occurrences of a specified regular expression.

Parameters

input: StringStringStringString

The string to search for a match.

pattern: StringStringStringString

The regular expression pattern to match.

Returns

MatchCollectionMatchCollectionMatchCollectionMatchCollection

A collection of the Match objects found by the search. If no matches are found, the method returns an empty collection object.

Exceptions

ArgumentExceptionArgumentExceptionArgumentExceptionArgumentException

A regular expression parsing error occurred.

ArgumentNullExceptionArgumentNullExceptionArgumentNullExceptionArgumentNullException

input or pattern is null.

Examples

The following example uses the Matches(String, String) method to identify any word in a sentence that ends in 'es'.

The regular expression pattern bw+esb is defined as shown in the following table.

Pattern	Description
`b`	Begin the match at a word boundary.
`w+`	Match one or more word characters.
`es`	Match the literal string 'es'.
`b`	End the match at a word boundary.

Remarks

The Matches(String, String) method is similar to the Match(String, String) method, except that it returns information about all the matches found in the input string, instead of a single match. It is equivalent to the following code:

The static Matches methods are equivalent to constructing a Regex object with the specified regular expression pattern and calling the instance method Matches.

The pattern parameter consists of regular expression language elements that symbolically describe the string to match. For more information about regular expressions, see .NET Framework Regular Expressions and Regular Expression Language - Quick Reference.

Because of its lazy evaluation, calling the Matches(String, String) method does not throw a RegexMatchTimeoutException exception. However, the exception is thrown when an operation is performed on the MatchCollection object returned by this method, if a time-out interval is defined by the 'REGEX_DEFAULT_MATCH_TIMEOUT' property of the current application domain and a matching operation exceeds this time-out interval.

Notes to Callers

This method times out after an interval that is equal to the default time-out value of the application domain in which it is called. If a time-out value has not been defined for the application domain, the value InfiniteMatchTimeout, which prevents the method from timing out, is used. The recommended static method for retrieving multiple pattern matches is Matches(String, String, RegexOptions, TimeSpan), which lets you specify the time-out interval.

See also

Matches(String, String, RegexOptions)Matches(String, String, RegexOptions)Matches(String, String, RegexOptions)Matches(String, String, RegexOptions)

Searches the specified input string for all occurrences of a specified regular expression, using the specified matching options.

Parameters

input: StringStringStringString

The string to search for a match.

pattern: StringStringStringString

The regular expression pattern to match.

options: RegexOptionsRegexOptionsRegexOptionsRegexOptions

A bitwise combination of the enumeration values that specify options for matching.

Returns

MatchCollectionMatchCollectionMatchCollectionMatchCollection

A collection of the Match objects found by the search. If no matches are found, the method returns an empty collection object.

Exceptions

ArgumentExceptionArgumentExceptionArgumentExceptionArgumentException

A regular expression parsing error occurred.

ArgumentNullExceptionArgumentNullExceptionArgumentNullExceptionArgumentNullException

input or pattern is null.

ArgumentOutOfRangeExceptionArgumentOutOfRangeExceptionArgumentOutOfRangeExceptionArgumentOutOfRangeException

options is not a valid bitwise combination of RegexOptions values.

Examples

The following example calls the Matches(String, String) method to identify any word in a sentence that ends in 'es', and then calls the Matches(String, String, RegexOptions) method to perform a case-insensitive comparison of the pattern with the input string. As the output shows, the two methods return different results.

The regular expression pattern bw+esb is defined as shown in the following table.

Pattern	Description
`b`	Begin the match at a word boundary.
`w+`	Match one or more word characters.
`es`	Match the literal string 'es'.
`b`	End the match at a word boundary.

Remarks

The Matches(String, String, RegexOptions) method is similar to the Match(String, String, RegexOptions) method, except that it returns information about all the matches found in the input string, instead of a single match. It is equivalent to the following code:

The static Matches methods are equivalent to constructing a Regex object with the specified regular expression pattern and calling the instance method Matches.

Notes to Callers

See also

Matches(String, String, RegexOptions, TimeSpan)Matches(String, String, RegexOptions, TimeSpan)Matches(String, String, RegexOptions, TimeSpan)Matches(String, String, RegexOptions, TimeSpan)

Searches the specified input string for all occurrences of a specified regular expression, using the specified matching options and time-out interval.

Parameters

input: StringStringStringString

The string to search for a match.

pattern: StringStringStringString

The regular expression pattern to match.

options: RegexOptionsRegexOptionsRegexOptionsRegexOptions

A bitwise combination of the enumeration values that specify options for matching.

matchTimeout: TimeSpanTimeSpanTimeSpanTimeSpan

A time-out interval, or InfiniteMatchTimeout to indicate that the method should not time out.

Returns

MatchCollectionMatchCollectionMatchCollectionMatchCollection

A collection of the Match objects found by the search. If no matches are found, the method returns an empty collection object.

Exceptions

ArgumentExceptionArgumentExceptionArgumentExceptionArgumentException

A regular expression parsing error occurred.

ArgumentNullExceptionArgumentNullExceptionArgumentNullExceptionArgumentNullException

input or pattern is null.

ArgumentOutOfRangeExceptionArgumentOutOfRangeExceptionArgumentOutOfRangeExceptionArgumentOutOfRangeException

options is not a valid bitwise combination of RegexOptions values.

-or-

matchTimeout is negative, zero, or greater than approximately 24 days.

Examples

The following example calls the Matches(String, String, RegexOptions, TimeSpan) method to perform a case-sensitive comparison that matches any word in a sentence that ends in 'es'. It then calls the Matches(String, String, RegexOptions, TimeSpan) method to perform a case-insensitive comparison of the pattern with the input string. In both cases, the time-out interval is set to one second. As the output shows, the two methods return different results.

The regular expression pattern bw+esb is defined as shown in the following table.

Pattern	Description
`b`	Begin the match at a word boundary.
`w+`	Match one or more word characters.
`es`	Match the literal string 'es'.
`b`	End the match at a word boundary.

Remarks

The Matches(String, String, RegexOptions, TimeSpan) method is similar to the Match(String, String, RegexOptions, TimeSpan) method, except that it returns information about all the matches found in the input string, instead of a single match. It is equivalent to the following code:

The static Matches methods are equivalent to constructing a Regex object with the specified regular expression pattern and calling the instance method Matches.

Because of its lazy evaluation, calling the Matches method does not throw a RegexMatchTimeoutException exception. However, an exception is thrown when an operation is performed on the MatchCollection object returned by this method, if a matching operation exceeds this time-out interval specified by thematchTimeout parameter.

Notes to Callers

We recommend that you set the matchTimeout parameter to an appropriate value, such as two seconds. If you disable time-outs by specifying InfiniteMatchTimeout, the regular expression engine offers slightly better performance. However, you should disable time-outs only under the following conditions:

When the input processed by a regular expression is derived from a known and trusted source or consists of static text. This excludes text that has been dynamically input by users.
When the regular expression pattern has been thoroughly tested to ensure that it efficiently handles matches, non-matches, and near matches.
When the regular expression pattern contains no language elements that are known to cause excessive backtracking when processing a near match.

See also

What Are Regular Expressions?

How Are Oracle Database Regular Expressions Useful?

Oracle Database Implementation of Regular Expressions

Oracle Database Support for the POSIX Regular Expression Standard

Regular Expression Metacharacters in Oracle Database

POSIX Metacharacters in Oracle Database Regular Expressions

Regular Expression Operator Multilingual Enhancements

Perl-Influenced Extensions in Oracle Regular Expressions

Using Regular Expressions in SQL Statements: Scenarios

Using an Integrity Constraint to Enforce a Phone Number Format

Using Back References to Reposition Characters

Definition

Overloads

Matches(String)Matches(String)Matches(String)Matches(String)

Parameters

Returns

Exceptions

Examples

Remarks

Matches(String, Int32)Matches(String, Int32)Matches(String, Int32)Matches(String, Int32)

Parameters

Returns

Exceptions

Examples

Remarks

Matches(String, String)Matches(String, String)Matches(String, String)Matches(String, String)

Parameters

Returns

Exceptions

Examples

Remarks

Notes to Callers

Matches(String, String, RegexOptions)Matches(String, String, RegexOptions)Matches(String, String, RegexOptions)Matches(String, String, RegexOptions)

Parameters

Returns

Exceptions

Examples

Remarks

Notes to Callers

Matches(String, String, RegexOptions, TimeSpan)Matches(String, String, RegexOptions, TimeSpan)Matches(String, String, RegexOptions, TimeSpan)Matches(String, String, RegexOptions, TimeSpan)

Parameters

Returns

Exceptions

Examples

Remarks

Notes to Callers

Applies to