Hi all, we would like to organize a Hackathon while at the ESWC.
We want to do it on Monday, May 27 in Montpillier, France.
We are probably going to use https://www.hackerleague.org/ to manage signups
Theme:
The ability to extract meaningful, machine-interpretable data from
scholarly publications in PDF form is a big challenge. Several open
source libraries exist that attempt to automate this process, but work
needs to be done on them to improve accuracy and reliability. Some
specific and relevant challenges include:
Ability to automatically identify and tokenize citations from the PDF
(or more accurately, from a string of text)
Ability to automatically identify those blocks of text that represent
the narrative in a PDF.
Ability to identify references within the narrative, extract their
scope, and associate them with citation information in the PDF.
Anybody interested is welcome to join us, we will announce more
presice details later on this week. Also, we are looking for someone who co-organizes the meeting, ideally
someone who is local to Montpellier or to France.
Best.
Hackathon, extracting meaningful, machine-interpretable data from scholarly publications
Related Articles
FORCE2026 Programme Now Available: Join Us in Singapore
Kayode Oladapo
20 Feb 2026
No Comments
FORCE2026 Welcomes Founding Sponsors
Kayode Oladapo
10 Feb 2026
No Comments
Lead a FORCE11 Working Group and Help to Advance Communications
FORCE11 Admin
25 Nov 2025
No Comments
Charleston Conference Asia: Join Us in FSCI Preconference Workshops
FORCE11 Admin
12 Nov 2025
No Comments
FORCE11 PREreview Club Publishes First Review
Jennifer Miller
7 Nov 2025
No Comments
FORCE2026
To Go Far, Go Together: Advancing Scholarly Communication Across Boundaries and Disruptions
3 - 5 June 2026 @ Singapore Management University
Membership
Join the FORCE11 community and take part in our groups, conference, summer school, post on FORCE11, and attend other events.