entity-fishing: a DARIAH entity recognition and disambiguation service
- Published: 19 Nov 2020
- Publisher: HAL CCSD
- Country: France
- Link this publication to...
- Cite this publication
Add to ORCID
Please grant OpenAIRE to access and update your ORCID works.This research outcome is the result of merged research outcomes in OpenAIRE.
You have already added 0 works in your ORCID record related to the merged research outcome.add annotation
- Inria research centre Lille - Nord Europe France
- Romark (United States) United States
- Immunology, inflammation, infectiology and microbiology France

- DARIAH EU
- Funder: European Commission (EC)
- Project Code: 731102
- Funding stream: H2020 | RIA
- 1
- 2
Brando, Carmen, Francesca Frontini, and Jean-Gabriel Ganascia. 2016. “REDEN: Named Entity Linking in Digital Literary Editions Using Linked Data Sets.” Complex Systems Informatics and Modeling Quarterly, no. 7: 60-80. doi:10.7250/csimq.2016- 7.04. [OpenAIRE]
Buddenbohm, Stefan, and Raisa Barthauer. 2017. “D 4.1 - Gap Analysis of DARIAH Research Infrastructure.” DARIAH research report.. https://hal.archivesouvertes.fr/hal-01663594.
Cucerzan, Silviu. 2007. “Large-Scale Named Entity Disambiguation Based on Wikipedia Data.” In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLPCoNLL), 708-16. Stroudsburg, PA: Association for Computational Linguistics. https://www.aclweb.org/anthology/volumes/D07-1/.
Edwards, Paul N. 2003. “Infrastructure and Modernity: Force, Time, and Social Organization in the History of Sociotechnical Systems.” In Modernity and Technology, edited by Thomas J. Misa, Philip Brey, and Andrew Feenberg, 185-225. Cambridge, MA: MIT Press.
Lopez, Patrice. 2009. “GROBID: Combining Automatic Bibliographic Data Recognition and Term Extraction for Scholarship Publications.” In Research and Advanced Technology for Digital Libraries: 13th European Conference, ECDL 2009…: Proceedings, edited by Maristella Agosti, José Borbinha, Sarantos Kapidakis, Christos Papatheodorou, and Giannis Tsakonas, 473-74. Lecture Notes in Computer Science 5714. Berlin, Heidelberg: Springer.
Lopez, Patrice. 2017. “Entity-Fishing.” Slides presented at WikiDataCon 2017, Berlin, Germany, October 28-29. Last revised 8 February 2018, https://www.wikidata.org/wiki/Wikidata:WikidataCon_2017/Documentation; accessed July 11, 2020, https://grobid.s3.amazonaws.com/presentations/29-10- 2017.pdf.
Lopez, Patrice, Alexander Meyer, and Laurent Romary. 2014. “CENDARI Virtual Research Environment & Named Entity Recognition Techniques.” Poster presented at the conference Grenzen überschreiten - Digitale Geisteswissenschaft heute und morgen, Berlin, Germany, February 28, 2014. Einstein-Zirkel Digital Humanities. https://hal.inria.fr/hal-01577975.
Milne, David N., Ian H. Witten, and David M. Nichols. 2007. “Extracting Corpus Specific Knowledge Bases from Wikipedia.” Working paper series, no. 03/2007, Department of Computer Science, University of Waikato, Hamilton, New Zealand. https://hdl.handle.net/10289/69.
Nadeau, David, and Satoshi Sekine. 2007. “A Survey of Named Entity Recognition and Classification.” In Named Entities: Recognition, Classification and Use, edited by Satoshi Sekine and Elisabete Ranchhod [Lingvisticae Investigationes 30:1], 3-26. [Amsterdam and Philadelphia]: John Benjamins. doi:10.1075/li.30.1.03nad.
Pellissier Tanon, Thomas, Denny Vrandečić, Sebastian Schaffert, Thomas Steiner, and Lydia Pintscher. 2016. “From Freebase to Wikidata: The Great Migration.” In WWW '16: Proceedings of the 25th International Conference on World Wide Web, 1419-28. Geneva, Switzerland: International World Wide Web Conferences Steering Committee. doi:10.1145/2872427.2874809.
Ratinov, Lev, Dan Roth, Doug Downey, and Mike Anderson. 2011. “Local and Global Algorithms for Disambiguation to Wikipedia.” In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 1:1375-84. Stroudsburg, PA: Association for Computational Linguistics. https://www.aclweb.org/anthology/P11-1138/.
Romary, Laurent, and Jennifer Edmond. 2017. “Sustainability in DARIAH.” Presentation at Sustainability of Digital Research Infrastructures for the Arts and Humanities (Workshop at the DARIAH Annual Event), Berlin, Germany, April 27. https://hal.inria.fr/hal-01516487.
Salton, Gerard, and Michael J. McGill. 1983. Introduction to Modern Information Retrieval. New York: McGraw-Hill.
Smith, David A., and Gregory Crane. 2001. “Disambiguating Geographic Names in a Historical Digital Library.” In Research and Advanced Technology for Digital Libraries: 5th European conference, ECDL 2001…: Proceedings, edited by Panos Constantopoulos and Ingeborg T. Sølvberg, 127-36. Lecture Notes in Computer Science 2163. Berlin: Springer.
Steiner, Thomas. 2014. “Bots vs. Wikipedians, Anons vs. Logged-Ins (Redux): A Global Study of Edit Activity on Wikipedia and Wikidata.” In OpenSym '14: Proceedings of the International Symposium on Open Collaboration, 25:1-25:7. New York: ACM. doi:10.1145/2641580.2641613.
- 1
- 2
- 2018 . IsAmongTopNSimilarDocuments
- Inria research centre Lille - Nord Europe France
- Romark (United States) United States
- Immunology, inflammation, infectiology and microbiology France

- DARIAH EU
- Funder: European Commission (EC)
- Project Code: 731102
- Funding stream: H2020 | RIA
- 1
- 2
Brando, Carmen, Francesca Frontini, and Jean-Gabriel Ganascia. 2016. “REDEN: Named Entity Linking in Digital Literary Editions Using Linked Data Sets.” Complex Systems Informatics and Modeling Quarterly, no. 7: 60-80. doi:10.7250/csimq.2016- 7.04. [OpenAIRE]
Buddenbohm, Stefan, and Raisa Barthauer. 2017. “D 4.1 - Gap Analysis of DARIAH Research Infrastructure.” DARIAH research report.. https://hal.archivesouvertes.fr/hal-01663594.
Cucerzan, Silviu. 2007. “Large-Scale Named Entity Disambiguation Based on Wikipedia Data.” In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLPCoNLL), 708-16. Stroudsburg, PA: Association for Computational Linguistics. https://www.aclweb.org/anthology/volumes/D07-1/.
Edwards, Paul N. 2003. “Infrastructure and Modernity: Force, Time, and Social Organization in the History of Sociotechnical Systems.” In Modernity and Technology, edited by Thomas J. Misa, Philip Brey, and Andrew Feenberg, 185-225. Cambridge, MA: MIT Press.
Lopez, Patrice. 2009. “GROBID: Combining Automatic Bibliographic Data Recognition and Term Extraction for Scholarship Publications.” In Research and Advanced Technology for Digital Libraries: 13th European Conference, ECDL 2009…: Proceedings, edited by Maristella Agosti, José Borbinha, Sarantos Kapidakis, Christos Papatheodorou, and Giannis Tsakonas, 473-74. Lecture Notes in Computer Science 5714. Berlin, Heidelberg: Springer.
Lopez, Patrice. 2017. “Entity-Fishing.” Slides presented at WikiDataCon 2017, Berlin, Germany, October 28-29. Last revised 8 February 2018, https://www.wikidata.org/wiki/Wikidata:WikidataCon_2017/Documentation; accessed July 11, 2020, https://grobid.s3.amazonaws.com/presentations/29-10- 2017.pdf.
Lopez, Patrice, Alexander Meyer, and Laurent Romary. 2014. “CENDARI Virtual Research Environment & Named Entity Recognition Techniques.” Poster presented at the conference Grenzen überschreiten - Digitale Geisteswissenschaft heute und morgen, Berlin, Germany, February 28, 2014. Einstein-Zirkel Digital Humanities. https://hal.inria.fr/hal-01577975.
Milne, David N., Ian H. Witten, and David M. Nichols. 2007. “Extracting Corpus Specific Knowledge Bases from Wikipedia.” Working paper series, no. 03/2007, Department of Computer Science, University of Waikato, Hamilton, New Zealand. https://hdl.handle.net/10289/69.
Nadeau, David, and Satoshi Sekine. 2007. “A Survey of Named Entity Recognition and Classification.” In Named Entities: Recognition, Classification and Use, edited by Satoshi Sekine and Elisabete Ranchhod [Lingvisticae Investigationes 30:1], 3-26. [Amsterdam and Philadelphia]: John Benjamins. doi:10.1075/li.30.1.03nad.
Pellissier Tanon, Thomas, Denny Vrandečić, Sebastian Schaffert, Thomas Steiner, and Lydia Pintscher. 2016. “From Freebase to Wikidata: The Great Migration.” In WWW '16: Proceedings of the 25th International Conference on World Wide Web, 1419-28. Geneva, Switzerland: International World Wide Web Conferences Steering Committee. doi:10.1145/2872427.2874809.
Ratinov, Lev, Dan Roth, Doug Downey, and Mike Anderson. 2011. “Local and Global Algorithms for Disambiguation to Wikipedia.” In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 1:1375-84. Stroudsburg, PA: Association for Computational Linguistics. https://www.aclweb.org/anthology/P11-1138/.
Romary, Laurent, and Jennifer Edmond. 2017. “Sustainability in DARIAH.” Presentation at Sustainability of Digital Research Infrastructures for the Arts and Humanities (Workshop at the DARIAH Annual Event), Berlin, Germany, April 27. https://hal.inria.fr/hal-01516487.
Salton, Gerard, and Michael J. McGill. 1983. Introduction to Modern Information Retrieval. New York: McGraw-Hill.
Smith, David A., and Gregory Crane. 2001. “Disambiguating Geographic Names in a Historical Digital Library.” In Research and Advanced Technology for Digital Libraries: 5th European conference, ECDL 2001…: Proceedings, edited by Panos Constantopoulos and Ingeborg T. Sølvberg, 127-36. Lecture Notes in Computer Science 2163. Berlin: Springer.
Steiner, Thomas. 2014. “Bots vs. Wikipedians, Anons vs. Logged-Ins (Redux): A Global Study of Edit Activity on Wikipedia and Wikidata.” In OpenSym '14: Proceedings of the International Symposium on Open Collaboration, 25:1-25:7. New York: ACM. doi:10.1145/2641580.2641613.
- 1
- 2
- 2018 . IsAmongTopNSimilarDocuments