We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent c4e2d88 commit 57bfa8bCopy full SHA for 57bfa8b
stanza/models/constituency/parse_tree.py
@@ -18,7 +18,11 @@
18
19
EMPTY_CHILDREN = ()
20
21
-CONSTITUENT_SPLIT = re.compile("[-=#]")
+# used to split off the functional tags from various treebanks
22
+# for example, the Icelandic treebank (which we don't currently
23
+# incorporate) uses * to distinguish 'ADJP', 'ADJP*OC' but we treat
24
+# those as the same
25
+CONSTITUENT_SPLIT = re.compile("[-=#*]")
26
27
# These words occur in the VLSP dataset.
28
# The documentation claims there might be *O*, although those don't
0 commit comments