2 results
Johannes Doleschal ; Benny Kimelfeld ; Wim Martens ; Liat Peterfreund.
The framework of document spanners abstracts the task of information extraction from text as a function that maps every document (a string) into a relation over the document's spans (intervals identified by their start and end indices). For instance, the regular spanners are the closure under the […]
Published on January 31, 2022
Johannes Doleschal ; Benny Kimelfeld ; Wim Martens.
Regular expressions with capture variables, also known as regex-formulas, extract relations of spans (intervals identified by their start and end indices) from text. In turn, the class of regular document spanners is the closure of the regex formulas under the Relational Algebra. We investigate the […]
Published on August 9, 2023