Class RegexBuilder

A wrapper class for JavaScript regular expressions that exposes the utility functions of this library and also provides a DSL for constructing regular expression.

Index

Methods

and concat derivative enumerate isDisjointFrom isEmpty isEquivalent isSubsetOf isSupersetOf not optional or repeat size toRegExp without

Methods

and

and(re: RegexLike): RegexBuilder
Constructs the intersection of two regex. This is useful to combine several constraints into one. For example, to build a regular expression that can validate a new password:
Parameters
- re: RegexLike
Returns RegexBuilder
Example
```
const passwordRegex = RB(/.{12,}/) // 12 letters or more
  .and(/[0-9]/) // at least one number
  .and(/[A-Z]/) // at least one upper case letter   
  .and(/[a-z]/) // at least one lower case letter
  .toRegExp()

function isValidPassword(str: string) {
  return passwordRegex.test(str)
}
```
In most cases it's simpler and more efficient to match each RegExp individually:
```
function isValidPassword(str: string) {
  return /.{12,}/.test(str) && /[0-9]/.test(str) && /[A-Z]/.test(str) && /[a-z]/.test(str)
}
```
However, this is not always possible. For example, when a third-party interface expect a single RegExp as input like:
- Express.js - for route parameter matching and path specifications
- Yup/Joi/Zod - for string pattern validation
- Webpack - in various configuration options like test, include, and exclude patterns
- fast-check - for random string generation during fuzzing / property based testing
- Defined in src/index.ts:138

concat

concat(re: RegexLike): RegexBuilder
Concatenates two regex.
Parameters
- re: RegexLike
Returns RegexBuilder
Example
```
RB('aaa').concat('bbb') // like /^aaabbb$/
```
- Defined in src/index.ts:162

derivative

derivative(prefix: string): RegexBuilder
Compute a Brzozowski derivative of the given RegExp.

TODO: examples.
Parameters
- prefix: string
Returns RegexBuilder
- Defined in src/index.ts:210

enumerate

enumerate(): Generator<string, any, any>
A generator function that returns a (potentially infinite) stream of strings that match the given RegExp. This can be useful for testing regular expressions.

Returns Generator<string, any, any>
Example
```
const emailRegex = /^[a-z]+@[a-z]+\.[a-z]{2,}$/

for (const matchedStr of RB(emailRegex).enumerate()) {
  console.log(matchedStr)
}
```
```
a@a.aa
b@a.aa
aa@a.aa
c@a.aa
ba@a.aa
a@b.aa
d@a.aa
ca@a.aa
b@b.aa
ab@a.aa
e@a.aa
da@a.aa
c@b.aa
bb@a.aa
aa@b.aa
f@a.aa
ea@a.aa
d@b.aa
cb@a.aa
ba@b.aa
a@aa.aa
g@a.aa
...
```
Warning
If the regular expression matches infinitely many strings then a loop like above won't terminate.

Tip
Use the new Iterator helpers to only get the first N matches, e.g RB(emailRegex).enumerate().take(100).

The generator produces a fair enumeration. That means every string that matches the regular expression is eventually enumerated. To illustrate, an unfair enumeration of /^(a+|b+)$/ would be:
```
"a", "aa", "aaa", "aaaa", "aaaaa", ...
```
because it never produces any strings of b's. A possible fair enumeration is:
```
"a", "b", "aa", "bb", "aaa", "bbb", "aaaa", "bbbb", ...
```
- Defined in src/index.ts:300

isDisjointFrom

isDisjointFrom(re: RegexLike): boolean
TODO
Parameters
- re: RegexLike
Returns boolean
- Defined in src/index.ts:386

isEmpty

isEmpty(): boolean

Checks if the regex matches no strings at all.

Returns boolean

Example

RB('a').isEmpty() // false
RB('').isEmpty() // false
RB('a').and('b').isEmpty() // true
RB(/$.^/).isEmpty() // true

isEquivalent

isEquivalent(re: RegexLike): boolean
Checks if two regular expressions are semantically equivalent, i.e. they match the exact same set of strings.
Parameters
- re: RegexLike
Returns boolean
Example
```
RB(/a{1,}/).isEquivalent(/a+/) // true
```
- Defined in src/index.ts:343

isSubsetOf

isSubsetOf(re: RegexLike): boolean
TODO
Parameters
- re: RegexLike
Returns boolean
- Defined in src/index.ts:368

isSupersetOf

isSupersetOf(re: RegexLike): boolean
TODO
Parameters
- re: RegexLike
Returns boolean
- Defined in src/index.ts:377

not

not(): RegexBuilder
Constructs the regex complement, i.e. the regex that matches exactly the strings that the current regex is not matching.

TODO: examples.

Returns RegexBuilder
- Defined in src/index.ts:148

optional

optional(): RegexBuilder
This is like the ? postfix operator.

Returns RegexBuilder
Example
```
RB('a').optional() // like /^a?$/
```
- Defined in src/index.ts:199

or

or(re: RegexLike): RegexBuilder
This is like the regex pipe operator | (aka. alternation, aka. union, aka. or).
Parameters
- re: RegexLike
Returns RegexBuilder
Example
```
RB('a').or('b') // like /^(a|b)$/
```
- Defined in src/index.ts:99

repeat

repeat(bounds?: RepeatBounds): RegexBuilder

Constructs quantified regular expressions, subsuming all these regex operators: *, +, {n,m}, ?.

Parameters

bounds: RepeatBounds = ...

Returns RegexBuilder

Example

RB('a').repeat(4) // a{4}
RB('a').repeat({ min: 3, max: 5 }) // a{3,5}
RB('a').repeat({ max: 5 }) // a{,5}
RB('a').repeat({ min: 3 }) // a{3,}
RB('a').repeat({ min: 0, max: 1 }) // a?
RB('a').repeat({ min: 0 }) // a*
RB('a').repeat() // a*

size

size(): undefined | bigint
Returns the number of strings that match the regex or undefined if there are infinitely many matches.

Returns undefined | bigint
Example
```
RB(/^[a-z]$/).size() === 26n

RB(/^[a-z][0-9]$/).size() === 260n

// this one has infinitely many matches:
RB(/^[a-z]*$/).size() === undefined

// that's why the return type is `bigint`;
RB(/^[a-z]{60}/).size() === 7914088058189701615326255069116716194962212229317838559326167922356251403772678373376n 
```
Note
Double counting is often avoided. For example, RB(/^(hello|hello)$/).size() is only 1n and not 2n. But it probably still happens. The value should always be an upper bound though.
- Defined in src/index.ts:238

toRegExp

toRegExp(): RegExp
Converts back to a native JavaScript RegExp.

Returns RegExp
Warning
The generated RegExp can be very large if it was constructed with .and(...) or .not(...).
- Defined in src/index.ts:311

without

without(re: RegexLike): RegexBuilder
Constructs the difference of the current regex and re. That is, this returns a new regex which matches all strings that the current regex matches EXCEPT everything that re matches.
Parameters
- re: RegexLike
Returns RegexBuilder
Example
```
RB(/^a*$/).without(/^a{5}$/) // /^(a{0,4}|a{6,})$/
```
- Defined in src/index.ts:359

Class RegexBuilder

Index

Methods

Methods

and

Parameters

Returns RegexBuilder

Example

concat

Parameters

Returns RegexBuilder

Example

derivative

Parameters

Returns RegexBuilder

enumerate

Returns Generator<string, any, any>

Example

isDisjointFrom

Parameters

Returns boolean

isEmpty

Returns boolean

Example

isEquivalent

Parameters

Returns boolean

Example

isSubsetOf

Parameters

Returns boolean

isSupersetOf

Parameters

Returns boolean

not

Returns RegexBuilder

optional

Returns RegexBuilder

Example

or

Parameters

Returns RegexBuilder

Example

repeat

Parameters

Returns RegexBuilder

Example

size

Returns undefined | bigint

Example

toRegExp

Returns RegExp

Warning

without

Parameters

Returns RegexBuilder

Example

Settings

On This Page