University of La Rochelle

ULR

The University of La Rochelle is a French higher education and research institution founded in 1993. Within this the L3i (Laboratoire Informatique, Image et Interaction) was created in the same year andcentres on the study of digital documents that are produced by humans and meant for humans. 

Antoine Doucet leads the research group on Digital Images and ContentsThe profile of the research group is a good match to lead the project as well as to support the tools of semantic enrichment and participate and operate in a multilingual and potentially noisy and massive setting. This is true as well for its experience in combining semantics with image-based document analysis will be particularly relevant to the article separation task.

Team Biographies

Antoine Doucet

Project Coordinator

Prof. Antoine Doucet is a tenured Full Professor at the L3i laboratory of the University of La Rochelle since 2014. He obtained a PhD in computer science from the University in Helsinki in 2005, and holds a French habilitation (HDR) since 2012.

Antoine's main research interests lie in the fields of information retrieval (structured and semi-structured) and natural language processing. In particular, the central focus of his work is on the development of methods that scale to very large document collections and that do not require prior knowledge of the data (in particular, techniques that function for documents written in any language). Antoine holds a grant for scientific excellence from the French government since 2013 and has obtained several best papers awards (best paper in 2015 for an AIIM journal paper on multilingual event detection, out of 1,272 candidates; and best HCI paper at HCI International 2014). He runs several projects at the University of La Rochelle around the robust semantic analysis of multilingual documents. A. Doucet has been involved in the international workshop on Histoinformatics since its first occurrence in 2013. He has launched and organized several international competitions and benchmarks at the crossroad of statistical natural language processing and document analysis (INEX and CLEF Book Search 2007-2013, ICDAR Book Structure extraction 2009, 2011 and 2013, ICDAR Post-OCR correction 2017).

Dr. Mickaël Coustaty

Dr. Mickaël Coustaty is a tenured Associate Professor at the L3i laboratory of the University of La Rochelle since 2015. He obtained a PhD in computer science in 2011 from University of La Rochelle, France on historical document images analysis and indexing. His main research interests are related to digital humanities, working with history scholars on the one hand, and combining semantic and image processing for fraud detection in administrative documents on the other. He published more than 50 papers and organized various international competitions. Recently, he managed Tourinfluxa large national project on touristic information analysis (4 partners, € 2.3M), and he currently leads a European project (ERA-NET CHIST-ERA SPIRIT) and two national projects (ERDF SECURDOC PC445 and ERDF SeAD PC528) related to fraud detection and content securing (around € 2M in total).