Optimizations for `re` filter

In #348, we decided to use anchored regular expressions by default.
This change was necessary to enable several optimizations for regular expression matching.

### Prefix and Suffix Literals

Before matching all tokens for a field from a search query, we can extract two literals from the regular expression: prefix and suffix literals. Using these two literals, we can apply two key optimizations:
- Reduce the number of token blocks to examine by performing a binary search over token blocks using the `(strings|bytes).HasPrefix()` function (as we already do for [literals](https://github.com/ozontech/seq-db/blob/158eee638dc3dd658bb9248c7ee094a8ad7eb3b1/pattern/pattern.go#L341-L343));
- For each token we match against, use `(strings|bytes).HasSuffix()` before running the regular expression NFA.

### Set Lookups

Sometimes we can transform a regular expression into a basic search query. For example, consider this query using the `re` filter: `k8s_pod:re("pod-1|pod-2")`. It is easy to see that we can rewrite it as `k8s_pod:'pod-1' OR k8s_pod:'pod-2'`.

I am pretty sure there are many more optimizations we can implement, so this is a topic worth researching further.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimizations for `re` filter #355

Prefix and Suffix Literals

Set Lookups

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Optimizations for re filter #355

Description

Prefix and Suffix Literals

Set Lookups

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Optimizations for `re` filter #355