PET

PET: A new Dataset for Process Extraction from Natural Language Text

TextsIntroduced 2022-03-09

The dataset contains 45 documents containing narrative description of business process and their annotations. Annotated with activities, gateways, actors, and flow information.

Each document is composed of three files:

Doc_name.txt (Process description in CONLL format)

Doc_name.process-elements.IOB2.txt (Process elements annotated with IOB2 Schema in CONLL format)

Doc_name.relations.tsv (Process relations between process elements. Each line is a triplette (source, relation tag, target). Source and target are in the form: n_sent_x words range.)