‘SCF Extractor (IT)’ is a service that performs inductive subcategorisation extraction from dependency parsed texts, formatted according to the CoNLL-X format. This tool/service was optimized for Italian.
The SCF Extractor tool was developed at CNR-ILC and deployed as a soap web service within the EU-FP7-STREP PANACEA project (www.panacea-lr.eu).
The service requires 2 input data: 1) a dependency parsed text corpus the CONLL-X format; 2) a list of verb lemmas for which the subcategorization frames will be extracted.
For details on the tool design please see:
– Rimell, Laura, Núria Bel, Muntsa Padró, Francesca Frontini, Monica Monachini and Valeria Quochi. 2012. D6.2 Integrated Final Version of the Components for Lexical Acquisition. Final Project Report. EC/FP7/248064. PANACEA project.
– Caselli, Tommaso; Frontini, Francesca; Quochi, Valeria; Rubino, Francesco and Russo, Irene. (2012). Customizable SCF Acquisition in Italian.In Proceedings of LREC 2012, Istanbul, Turkey.
URL: SCF Extractor (WSDL)