Skip to content

Commit 57bfa8b

Browse files
committed
Add * to the list of functional tags, such as in #1356 (comment)
1 parent c4e2d88 commit 57bfa8b

File tree

1 file changed

+5
-1
lines changed

1 file changed

+5
-1
lines changed

stanza/models/constituency/parse_tree.py

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,11 @@
1818

1919
EMPTY_CHILDREN = ()
2020

21-
CONSTITUENT_SPLIT = re.compile("[-=#]")
21+
# used to split off the functional tags from various treebanks
22+
# for example, the Icelandic treebank (which we don't currently
23+
# incorporate) uses * to distinguish 'ADJP', 'ADJP*OC' but we treat
24+
# those as the same
25+
CONSTITUENT_SPLIT = re.compile("[-=#*]")
2226

2327
# These words occur in the VLSP dataset.
2428
# The documentation claims there might be *O*, although those don't

0 commit comments

Comments
 (0)