Overview of the Cross-Domain Authorship Verification Task at PAN 2020 - Mike Kestemont, Enrique Manjavacas, Ilia Markov, Janek Bevendorff, Matti ...

Page created by Shirley Joseph

Education

English

Like
Share
Embed
Fullscreen
Slides
Download HTML
Download PDF
Abuse

←

→

Page content transcription

If your browser does not render page correctly, please read the page content below

Overview of the Cross-Domain Authorship Verification Task at PAN 2020 - Mike Kestemont, Enrique Manjavacas, Ilia Markov, Janek Bevendorff, Matti ...

Overview of the Cross-
      Domain Authorship
Verification Task at PAN 2020
    Mike Kestemont, Enrique Manjavacas, Ilia Markov, Janek Bevendorﬀ,
    Matti Wiegmann, Efstathios Stamatatos, Martin Potthast & Benno Stein

                   pan@webis.de     https://pan.webis.de

Renewed strategy
•   Long-running task on relevant problem, but:

    •   Lack of diversity in submissions recently

    •   Lack of large-scale resources in field in general

    •   Lack of task realism (cf. court settings)

•   Renewed 3-year strategy, increasing diﬃculty, scope and realism

    •   Year 1 (2020): increased size

    •   Year 2 (2021): increased diﬃculty

    •   Year 3 (2022): “mystery task”

Task
•   Authorship verification (and not attribution, obfuscation, …)

•   Calibration and test set consist of series of “problems”:

    •   Given a pair of texts, assign a verification score [0, 1]

    •   < 0.5 (diﬀerent-author: DA) or > 0.5 (same-author: SA)

    •   Exactly 0.5: non-response (for “diﬃcult” pairs)

•   Only unseen test texts, but “closed” scenario: no new
    authors. All pairs are cross-fandom.

Benchmark dataset
•   Fanfiction dataset (from fanfiction.net): non-professional
    authors expanding “canons” of well-known works and
    authors (“fandoms”)

    •   English-language (but global phenomenon)

    •   Huge scale (and no moderation)

    •   User-provided metadata

    •   Fandom information as a proxy for “domain”

•   Emphasis: fandom information available to participants (!)

Dataset size
      (Largest resource in verification that we know of)

                    Same-        Diﬀerent-
                    Author        Author        # fandoms       SA authors      DA authors
                     Pairs         Pairs

Train (“large”)      148K           128K            1.6K            41K            250K

Train (“small”)       28K           25K             1.6K            25K            48.5K

 Test (2020)          10K           6.9K            0.4K            3.5K            12K

Interesting diﬀerences: some only used subset of “small”, others enlarged “large” even further

Evaluation framework
•   Varied set of 4 metrics, sensitive to diﬀerent aspects:

    •   AUC: conventional area-under-the-curve score

    •   c@1: variant of F1, rewarding systems that leave diﬃcult
        problems unanswered

    •   F1: classic metric, but not taking into account non-answers

    •   F0.5u: new measure, emphasis on deciding same-author
        cases correctly

•   Combined score for final ranking

Two baselines
    Straightforward but competitive

Calibrated on “small” set only (give “large” systems edge):

  1. Cosine similarity between TF-IDF BOW of 4-grams
     (with naive “hack” to shift scores)

  2. Text compression method, based on cross-entropy for
     “text2” using Prediction by Partial Matching

[All code available from Github (https://github.com/pan-webis-de/pan-code/tree/
         master/clef20); all data from Zenodo (https://zenodo.org/record/
                             3716403#.X2neLpMzZ25)]

Submissions
•   13 submissions from 10 teams

•   Novelty: no calibration on Tira (only testing/deployment)
    for more flexibility

•   3 teams submitted “small” and “large” versions

    •   Others used “small” (or subset!) apart from ordonez20

•   Much more diverse array of methods, including use of
    e.g. siamese nets and fandom info

Results
Pair-wise differences mostly significant: indicative of diversity

Analysis (1): “small” distributions
     Number heaping but strong metaclassifier

  [Last year, metaclassifier did not outperform strongest participant…]

Analysis (2): Non-answers
boeninghoff20 surprisingly solid non-response without compromizing score

Analysis (3): topic model
  NMF on TFIDF | 150 dims | top 5K tokens

Analysis (4):
Topic effect is real

         All score: “small” meta-classifier on test set

Conclusions
•   Higher diversity in submissions lead to interesting edition

•   Reliably established that scale and size matter

•   Promising new neural approaches (but brittle, cf. ordonez20)

•   Closing in on solution for in-domain authorship attribution

•   Topic-author orthogonality remains holy grail (next year!)

•   Next year:

    •   same training data

    •   but much (!) more challenging test dataset

Many thanks to the AV@PAN team,
  but especially the participants!

Check out the task overview paper for
  more info and see you next year!

You can also read

TIGER WOODS www.FAMOUS PEOPLE LESSONS.com

CF10 Arms Park Rugby Trust - Open Meeting and AGM 2017 - CF10 Rugby Trust

Hogmanay Programme of Events 2018-2019 - The Green Park Hotel

Please find the revision sheet below

2020 Annual Arctic Ice Atlas - Winter 2019 2020 - Canada.ca

Financial Aid Resources - AWS

Poland Seminary High School Graduation Requirements 2021 & Beyond

Market Update to 31 March 2021 - April 2021 - wealthpoint.co.nz - Wealthpoint Town & Country

LOGSTOR VALUE PROPOSITION Sapporo 2019 Tommy Lorezen

EQUIPPERS COLLEGE Information Brochure

Class Stationery and School Donations 2018 Purchasing and Payment Options - Reremoana School

An introduction to Student Finance 2020-21 - Sam Preest Finance Officer - Harrogate ...

Welcome to the Student Retention Unit live broadcast - Dr M.S Binza 20 March 2019 12:00 - 13:00 - Unisa

Supplement of Benefits of sea ice initialization for the interannual-to-decadal climate pre-diction skill in the Arctic in EC-Earth3 - GMD

2020 PROXY SEASON REVIEW & OTHER TOPICS - Hugessen TSX60 Webinar

ROUSSEAU MCCLELLAN MONTESSORI SCHOOL 91 - STATE OF THE SCHOOL, SEPTEMBER 2019 - STAKEHOLDERS ...

2021 CPMEC Awards - samet

Public Attitudes Monitor - January 2021 - Independent research & evaluation - Wind Energy Ireland

Mountain States Poll October 2020 33rd edition - Dr. Jason M. Adkins, Director - MSU Billings

BSOCSCIHONS GEOGRAPHY AND ENVIRONMENTAL SCIENCE (01243010)

Co-op Academy Walkden: Catch Up Review 2019 2020

THE DEBS 2019 Grand Challenge - Oleh Bodunov, Vincenzo Gulisano, Hannaneh Najdataei, Zbigniew Jerzak, André Martin, Pavel Smirnov, Martin ...

The structuring role of three types of macrophytes on the planktonic communities of lake San Pablo, a tropical freshwater Andean system in ...

Douglas Library celebrates Ireland Reads campaign 25th February 2021

Year 4 Maple Class Wednesday 27th January 2021 - Use the link sent out at the beginning of the week - New Valley Primary School

TUESDAY DE LEON GENERAL TEACHER EVALUATION - SAFETYCULTURE

TAVI POUR TOUS SANS COMPROMIS - PR CHRISTIAN SPAULDING DÉPARTEMENT DE CARDIOLOGIE HÔPITAL EUROPÉEN GEORGES POMPIDOU PARIS - High Tech Cardio

TOWARDS LOAD BALANCING IN SDN NETWORKS DURING DDOS ATTACKS - MIKHAIL BELYAEV SVETLANA GAIVORONSKI

Relations between Gowdy and Bianchi spacetimes - Alan D. Rendall, Max Planck Institute for Gravitational Physics, Albert Einstein Institute, Am ...

REAL ID in Pennsylvania - PennDOT Driver and Vehicle Services - Montgomery County, PA

IONIAN Working towards the evolution and growth of Greece's Network - GRNOG12

Just how dangerous is that object? - Identifying and Managing Hazardous Materials In Museum Collections - C2C Webinar - June 23, 2021 - Connecting ...

WHAT DO WE KNOW ABOUT SARS-COV-2 - BRUNO LINA CNR DES VIRUS RESPIRATOIRES, HCL, VIRPATH, CIRI, U1111, UMR 5308, ENS, UCBL, LYON, FRANCE - IABS

"I have the anger of the people in me!" The Yellow Vests and the symbolic struggle over 'the people' - Delia Dumitrica & Mélodine Sommier ACI2020 ...

English and Cultural Classes - 2021 Programming Guide - University of Denver

ELIAVA TECHNOLOGY COMMERCIALIZATION OFFICE - COOPERATION FOR SUCCESS !!! EAPP: CONFERENCE "EU4INNOVATION: FOSTERING RESEARCH-INDUSTRY LINKS"

SOCIAL INTELLIGENCE ANALYSIS: OVERWHELMINGLY NEGATIVE RESPONSE TO PRESIDENT TRUMP'S STATEMENTS AFTER JULY 16 SUMMIT WITH RUSSIAN PRESIDENT - Ipsos

DARRINGTON 5 MILE RUN - THE 2018 RACE GUIDE SATURDAY 16TH JUNE 2018 - 12:20PM - BOOKITZONE

Nuevos Modelos de Negocio - What is Lean StartUp? Entrepreneurial Mindset 4th June 2021 - TRANSCOLAB

Data Enabled 26th March 2019 - 6G Wireless Summit

The physical oceanography of the transport of floating marine debris - Laura Gómez-Navarro, Erik van Sebille, Delphine Lobelle, Stefanie Ypma and ...

Data Acquisition and Use for good urban access in a period of Social Distancing

Flying with K8s - Cloud Expo Europe

2020 COVID-19 Virtual Learning Series Presented By: Thank You To Our Event Sponsor

The Car - A computer on wheels - What Does it Mean for the Automotive Industry? - ICSE 2018

A Multi-Agent Approach to Simulate Autonomous Traffic with Games: How to Transform GTA-SA/SA-MP in Your Simulation Platform - iaria

Tourism and our regions- the long haul PROFESSOR DAVID G SIMMONS LINCOLN UNIVERSITY NEW ZEALAND - ChristchurchNZ.com

TRANSPORTATION SUSTAINABILITY AND THE TRANSITION TO AUTONOMOUS VEHICLES - CAROL ATKINSON-PALOMBO, PHD ASSOCIATE PROFESSOR DEPARTMENT OF GEOGRAPHY

THE PAW PRINT - Brevard Public Schools

Digital Wellbeign Sprint 6.5.-22.5.2019 Your opportunity for new digital solutions! - NewCo Helsinki