An Extensible Schema For Building Large Weakly-labeled Semantic Corpora

س.ماثيو اينجليش

doi:10.61850/allj.v22i2.368

pdf (English)

منشور: May 30, 2016

DOI: https://doi.org/10.61850/allj.v22i2.368

الكلمات المفتاحية:

ويكي بيانات -المجمعات الدلالية -

س.ماثيو اينجليش

جامعة هونغ كونغ

الملخص

في البرمجة اللغوية العصبية، تقود البيانات البحث، كما يتضح من وتيرة استخدام الأعمال الأساسية لهندسة قواعد البيانات مثل The Penn Treebank كأساس للتجريب. تعد المجموعات المشروحة بخبرة واسعة النطاق تقليديًا باهظة الثمن وتستغرق وقتًا طويلاً في الإنتاج. دفع هذا النموذج الباحثين إلى اعتماد أساليب آلية لتوليد البيانات المصنفة باستخدام الأدوات المتاحة مثل Freebase وDBpedia و"صناديق المعلومات" الموجودة على صفحات ويكيبيديا. تم إدراج قواعد المعرفة هذه، أو هي في طور الدمج، في ويكي بيانات، وهي مبادرة لتركيز مستودعات البيانات المتباينة في تنسيق منظم يمكن قراءته آليًا. هذا المورد هو أداة بحث مهمة. في هذه الورقة، نراجع تجربتنا في استخدام ويكي بيانات في إنشاء مجموعة كبيرة مشروحة تحت إشراف بعيد، علاوة على ذلك، نجعل المواد، والكود المستخدم لإنشاء تعليقاتنا التوضيحية، متاحة مجانًا لجميع الأطراف المعنية.

Plum Analytics

Artifact Widget

كيفية الاقتباس

اينجليشس. (2016). مخطط قابل للتوسيع لبناء مجموعات دلالية كبيرة ذات علامات ضعيفة. AL-Lisaniyyat, 22(2), 18-22. https://doi.org/10.61850/allj.v22i2.368

إصدار

مجلد 22 عدد 2 (2016): v22i22016

القسم

Articles

وفقًا لسياسة النشر المعتمدة في الوصول الحر، تقرّ مجلة السانيات وتضمن احتفاظ المؤلفين بالملكية الكاملة والحصرية لحقوق التأليف وحقوق الملكية الفكرية المتعلقة بإسهاماتهم العلميةولا يترتب على نشر المقال في المجلة أي نقل أو تنازل أو تقييد لهذه الحقوق. ويظل للمؤلفين الحق الكامل في أعمالهم دون الحاجة إلى الحصول على إذن كتابي مسبق من المجلة

المراجع

Abad, azad, and alessandro Moschitti.2014.creating a standard for evaluating distant supervision for relation extraction.Italian conference on computational linguistics CLIC-IT.1.
Intxaurrondo, Ander, Mihai surdeanu, oier Loêz de lacalle, and Eneko agirre.2013.Removing noisy mentions for distant suoervision. Procesamiento del lenguaje natural 51.,41-48.
Hoffmann, Raphael, congle zhang, xiao ling, luke Zettlemoyer , and Daniel S.weld. 2011.Knowledge-based weak supervision for information extraction of overlapping relations.
Association for computational linguistics.in proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies-volume 1.541-550.
Manning, Christopher D. and surdeanu, Mihai and bauer, John and Finkel, Jenny and Bethard , steven J.and L+MC Closky , David. 2014. The Stanford core NLP Natural Language Processing Toolkit. Proceedings of 52nd annual meeting of the Association for computational linguistics: System Demonstrations.55-60.
Marcus, Mitchell p.,Mary Ann Marcinkiewicz, and Beatrice Santorini.1993.Building a large annotated corpus of English: the penn Treebank. Cambridge university press, Cambridge, UK.Camputational linguistics 19.2,313-330.
Mintz,Mike , steven Bills, Rion Snow, and dan jurafsky.2009. Distant Supevision for relation extraction without labeled data. Association for computational Linguistics In proceeding of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP.Volume2.
Riedel, Sebastian , Limin Tao , and Andrew Mc callum.2010. Modeling relations and their mentions without labeled text. Springer Berlin Heidelberg.Machine learning and knowledge discovery in databases. 148-163.
Riedel, Sebastian , Limin Tao , and Andrew Mc callum, and Benjamin M.Marlin. 2013. In Naacl-hlt.Linguistic data consortium, Philadephia.74-84.
Sandhaus, Evan. 2008. The new York times annotated corpus.Linguistic Data Consortium, Philadelphia.
Schoenmackers, Stefan, Oren Etzioni, Daniel S.Weld, and jesse davis.2012. Learning first-order horn clauses from web text.Association for computational linguistics.In proceedings of the 2010 conference on empirical methods in natural language processing.1088-1098.
Surdeanu , Mihai , Julie Tibshirani, Ramesh Nallapati, and Christopher D.Manning.2012.Multi-instance multi-label learning for relation extraction.Association for computational linguistics.in proceedings of the 2012 joint conference on empirical methods in natural language processing and computational natural language learning .455-465.
Vrandecic, Denny , and Markus Krotzsh.2014.Wikidata : Afree collaborative knowledgebase..comunications of the ACM57,no.10.78-85.
Erxleben, Fredo, Michael Gunther,Markus Krotzsch, Julian Mendez, and Denny Vrandecic.2014.Introduction wikidata to the linked data web.springer international publishing in the semantic web-iswcx2014.50-65.

##plugins.themes.bootstrap3.article.sidebar##

##plugins.themes.bootstrap3.article.main##

الملخص

##plugins.themes.bootstrap3.article.details##

المراجع