Jean-Baptiste Berthelin

Text mining applies information-extracting algorithms on large natural language text collections. The DEFT text-mining challenges have been an opportunity to demonstrate the diversity of techniques in this field, and yielded high-quality... more

Text mining applies information-extracting algorithms on large natural language text collections. The DEFT text-mining challenges have been an opportunity to demonstrate the diversity of techniques in this field, and yielded high-quality French written text corpora. The paper states possible definitions for text mining, along with its particular meaning within DEFT. Each campaign has been organized along definite steps, giving rise to specific problems, among which, the adjustment of measuring scales associated with opinion documents. Last, we examine matters in opinion meaning (present in the 2007 and 2009 campaigns) and the question of subjectivity in texts ant its processing by statistic or symbolic methods.

More Info: Published in BULAG n. 35, 2011

Research Interests:
Text Mining

Download (.pdf)

From 2005 onward, the French DEFT evaluation campaigns have been offering exploratory topics in text mining. The 2007 challenge was about classifying opinion texts, i. E., assigning an opinion class to each text in a corpus. The paper... more

From 2005 onward, the French DEFT evaluation campaigns have been offering exploratory topics in text mining. The 2007 challenge was about classifying opinion texts, i. E., assigning an opinion class to each text in a corpus. The paper presents an analytic overview of results obtained by the competitors in the challenge, as well as a synthetic assessment of methods that were submitted to evaluation.

More Info: Revue des Nouvelles Technologies de l'Information (RNTI), numéro E17, 2009. Co authored with Cyril Grouin, Martine Hurault-Plantet and Patrick Paroubek

Download (.pdf)

The relevance of human judgment in an evaluation campaign is illustrated here through the DEFT text mining campaigns. In a first step, testing a topic for a campaign among a limited number of human evaluators informs us about the... more

The relevance of human judgment in an evaluation campaign is illustrated here through the DEFT text mining campaigns. In a first step, testing a topic for a campaign among a limited number of human evaluators informs us about the feasibility of a task. This information comes from the results obtained by the judges, as well as from their personal impressions after passing the test. In a second step, results from individual judges, as well as their pairwise matching, are used in order to adjust the task (choice of a marking scale for DEFT-07 and selection of topical categories for DEFT-08). Finally, the mutual comparison of competitors' results, at the end of the evaluation campaign, confirms the choices we made at its starting point, and provides means to redefine the task when we shall launch a future campaign based on the same topic.

More Info: Co-authored with Cyril Grouin, Martine Hurault-Plantet and Patrick Paroubek

Publisher: portal.acm.org

Publication Date: Jan 1, 2001

Publication Name: Proceedings of the …

Research Interests:
Question Answering System and Quantitative Evaluation

Download (.pdf)

Abstract A question answering system will be more convincing if it can give a user elements concerning the reliability of its propositions. In order to address this problem, we chose to take the advice of several searches. First, we... more

Abstract A question answering system will be more convincing if it can give a user elements concerning the reliability of its propositions. In order to address this problem, we chose to take the advice of several searches. First, we search for answers in a reliable document ...

Publication Date: Jan 1, 2003

Publication Name: Questions and Answers: …

Research Interests:
Questions and Answers

Publisher: limsi.fr

Publication Date: Jan 1, 2007

Publication Name: … de l'atelier de clôture du …

Download (.pdf)

Publisher: limsi.fr

Publication Date: Jan 1, 1999

Publication Name: … : Workshop in the framework of the …

Download (.ps)

Publisher: portal.acm.org

Publication Date: Jan 1, 1980

Publication Name: Proceedings of the 8th conference on Computational …

Download (.pdf)

Publisher: archives.limsi.fr

Publication Date: Jan 1, 2006

Publication Name: Actes du 2ème dé fouilles …

Download (.pdf)

Publisher: limsi.fr

Publication Date: Jan 1, 2009

Publication Name: Fouille de données d' …

Download (.pdf)

More Info: Published in BULAG n. 35, 2011

Research Interests:
Text Mining

More Info: Revue des Nouvelles Technologies de l'Information (RNTI), numéro E17, 2009. Co authored with Cyril Grouin, Martine Hurault-Plantet and Patrick Paroubek

More Info: Co-authored with Cyril Grouin, Martine Hurault-Plantet and Patrick Paroubek

Publisher: portal.acm.org

Publication Date: Jan 1, 2001

Publication Name: Proceedings of the …

Research Interests:
Question Answering System and Quantitative Evaluation

Publication Date: Jan 1, 2003

Publication Name: Questions and Answers: …

Research Interests:
Questions and Answers

Publisher: limsi.fr

Publication Date: Jan 1, 2007

Publication Name: … de l'atelier de clôture du …

Publisher: limsi.fr

Publication Date: Jan 1, 1999

Publication Name: … : Workshop in the framework of the …

Publisher: portal.acm.org

Publication Date: Jan 1, 1980

Publication Name: Proceedings of the 8th conference on Computational …

Publisher: archives.limsi.fr

Publication Date: Jan 1, 2006

Publication Name: Actes du 2ème dé fouilles …

Publisher: limsi.fr

Publication Date: Jan 1, 2009

Publication Name: Fouille de données d' …

Publisher: Ph. D. Thesis, Paris VI

Publication Date: Jan 1, 1979

Jean-Baptiste Berthelin

More Info: Published in BULAG n. 35, 2011

Research Interests: Text Mining<div>()</div>

More Info: Revue des Nouvelles Technologies de l'Information (RNTI), numéro E17, 2009. Co authored with Cyril Grouin, Martine Hurault-Plantet and Patrick Paroubek

More Info: Co-authored with Cyril Grouin, Martine Hurault-Plantet and Patrick Paroubek

Publisher: portal.acm.org

Publication Date: Jan 1, 2001

Publication Name: Proceedings of the …

Research Interests: Question Answering System and Quantitative Evaluation<div>()</div>

Publication Date: Jan 1, 2003

Publication Name: Questions and Answers: …

Research Interests: Questions and Answers<div>()</div>

Publisher: limsi.fr

Publication Date: Jan 1, 2007

Publication Name: … de l'atelier de clôture du …

Publisher: limsi.fr

Publication Date: Jan 1, 1999

Publication Name: … : Workshop in the framework of the …

Publisher: portal.acm.org

Publication Date: Jan 1, 1980

Publication Name: Proceedings of the 8th conference on Computational …

Publisher: archives.limsi.fr

Publication Date: Jan 1, 2006

Publication Name: Actes du 2ème dé fouilles …

Publisher: limsi.fr

Publication Date: Jan 1, 2009

Publication Name: Fouille de données d' …

Publisher: Ph. D. Thesis, Paris VI

Publication Date: Jan 1, 1979

Log In

Research Interests:
Text Mining

Research Interests:
Question Answering System and Quantitative Evaluation

Research Interests:
Questions and Answers