Skip to content

System-T/CrowdsourcingSRL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Crowdsourcing SRL

This folder contains the data collected for:

  • A Novel Workflow for Accurately and Efficiently CrowdsourcingPredicate Senses and Argument Labels Youxuan Jiang, Huaiyu Zhu, Jonathan K. Kummerfeld, Yunyao Li, Walter Lasecki Findings of EMNLP, 2020

This data is primarily of value for analysing the behaviour of our annotators. For gold SRL data, see the complete CoNLL 2009 dataset.

If you use the data please cite our work:

@InProceedings{emnlp-findings20srl,
  title     = {A Novel Workflow for Accurately and Efficiently Crowdsourcing Predicate Senses and Argument Labels},
  author    = {Youxuan Jiang and Huaiyu Zhu and Jonathan K. Kummerfeld and Yunyao Li and Walter Lasecki},
  booktitle = {Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing},
  month     = {November},
  year      = {2020},
  location  = {Online},
  pages     = {},
  url       = {},
  software  = {https://github.com/System-T/CrowdsourcingSRL},
}

predicates-full.txt and arguments-full.txt

These are the complete contents of the final_task_results table for the word sense disambiguation task (predicates) and the argument labeling task (arguments). They were extracted using the following commands:

SELECT * FROM final_task_results WHERE dataset = "WSD_large_scale"
SELECT * FROM final_task_results WHERE dataset = "SRL_large_scale"

The columns are:

  • id, A unique ID for this row
  • sen_id, The sentence ID
  • va_id,
  • ori_gold, The original gold label
  • final_gold, A modified gold label based on errors found during our work
  • pred, The predicted label
  • ex_condition, The experimental condition
  • worker_decision, Where in the workflow the label was finalised
  • final_label, The final label chosen
  • dataset, Always 'WSD_large_scale'

srl-db.sql

This is the complete database of experimental results. It contains the following tables:

  • argument
  • final_task_results
  • frame
  • gold_argument
  • gold_verb
  • result
  • select_result
  • sentence
  • task
  • task_type
  • verb

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published