Ideas for research from social bots to beer - Thesis proposals - Florian Daniel

Page created by Alfred Brown

IT & Technique

English

Like
Share
Embed
Fullscreen
Slides
Download HTML
Download PDF
Abuse

←

→

Page content transcription

If your browser does not render page correctly, please read the page content below

Ideas for research from social bots to beer - Thesis proposals - Florian Daniel

Thesis proposals
Ideas for research from
social bots to beer

Florian Daniel
florian.daniel@polimi.it   February 25, 2019

Detection of harmful social bots

Identification of harmful communication patterns

Conversational screen readers

Domain-specific content extraction

(Social) Bot = algorithmically driven entity that behaves like a human
in online communications

Empirical study shows: bots may cause harm to humans
                                                                      Examples                 When abuse happens
                What bots do
What                                                                          rce, Cu sto merS vc   Disclose
                                                                                                                  What else can             1
                                                                      eCom me

else can              Chat
                                  Talk with user
                                                                                   AASlang
                                                                                                  sensitive facts
                                                                                                                  go wrong?              10
                                                                  Boo                                                        Denigrate
                                                                      s   tJui
they do?                          Redirect user    SM                          c
                                                      Ss                            e
                                                        ex                                                             a     Be grossly 3
                                                                                                                  m
                                                                                                               Pu             offensive          Regulated
                                                                              o
                                                                          Ore                                               Be indecent 4         by law
                                                      Tay
                                                    MS                                                                      or obscene
                                   Write post                                                      DeathThreat                              2
                                                                  o                                                        Be threatening
                                                             Geic
                      Post        Comment post                                                       Da                     Make false 6
                                                                                                        t    ing            allegations
                                                        SethRich                                                Iva
                                  Forward post                                                Sp                      na
                                                                                                am
     Action                                                                                                                                                     Abuse
                                                                                                                              Deceive

                                                                                              Pola
                                                                          W                        rBo                         Spam
                                                                              ise                        t
                                  Like message      Instag                       Sh
                                                           res                          ib
                                                                                          e                                   Spread
                    Endorse                                                                                                misinformation
                                                   Trump, Colluding                                                                             Not regulated
                                   Follow user                         Bots
                                                                                                                           Mimic interest

                                                                                                                            Clone profile
                                                               , Jason      Slotkin6
                                                    InstaClone
                   Participate     Create user                                                                             Invade space

F. Daniel, C. Cappiello, B. Benatallah. Bots Acting Like Humans: Understanding and Preventing Harm. IEEE Internet Computing,
2019, accepted for publication. https://ieeexplore.ieee.org/document/8611348

Detection of harmful social bots

                                           Tweets
                 Account

                                            How do we
                                            identify and
                                            classify bots
        Harm1                               according to the
                                   Human    harm they may
                Harm2      HarmN            cause?

What has been done so far?

Not Safe For Work

                                      Profile picture stolen from
                                      existing people (usually women)

                                                                      Personalized profile preferences
                                                                      aimed at emulating actual users
 Catchy profile
 description
 paired with
 unsafe URL

                                                                      Tweets aim to
                                                                      redirect users
                                                                      to untrusted URLs

Lorenzo Cannone, Matteo Di Pierro. Detection and Classification of Harmful Bots in Human-Bot Interactions on Twitter.
MSc Thesis, Politecnico di Milano, December 2018.

News-Spreader

                                 Propagandistic profile
                                 picture and tile

   Propaganda                                       High number of interactions
   hashtags and
   messages in
   description

                                                                Lots of (political) news
                                                                retweeting

Lorenzo Cannone, Matteo Di Pierro. Detection and Classification of Harmful Bots in Human-Bot Interactions on Twitter.
MSc Thesis, Politecnico di Milano, December 2018.

Spam-Bot
                                                                                         Profile tile with catchy
                                  Profile picture with                                   messages
                                  corporate logo

 Corporate link
                                                             High number of tweets
 in description
                                                             with high frequency
 paired with
 messages of
 job oﬀers
 (product sales)

                                                              Repetitive tweets aimed at
                                                              spamming URLs

Lorenzo Cannone, Matteo Di Pierro. Detection and Classification of Harmful Bots in Human-Bot Interactions on Twitter.
MSc Thesis, Politecnico di Milano, December 2018.

Fake-Follower

                        Optional profile picture                                               Poor profile
                        (usually missing)                                                      customization

  Poor                                             Low number of content posting (usually null)
  description                                      Following as main interaction
  (usually
  empty)

                                                           Optional (re)tweeting
                                                           activity

Lorenzo Cannone, Matteo Di Pierro. Detection and Classification of Harmful Bots in Human-Bot Interactions on Twitter.
MSc Thesis, Politecnico di Milano, December 2018.

Thesis ingredients

               Creation of
               datasets

                                  Multi-class ensemble classifier

      Feature selection and engineering

Can we add more types of harm?
How to train classes as users use BotBuster?                        Web app
                                                                Figure 7.3: Client - Server architecture

                                            Inception neural network. Users with less than 10 tweets, or less than 10

Identification of harmful communication patterns

                     Bot code repositories

           GitHub

             Which potential harms can we identify
             inside the code of the bots?

ne
                                                                                                                 ts

                                                                                                                                                                                           n
                                                                                                                                                    s
                                                                                                                                                  ce
                                                                                                               fac

                                                                                                                                                                                        tio
                                                                                                                                                 on
                                                                                                                                                 e

                                                                                                                                              bs

                                                                                                                                                                                     ma
                                                                                                                                             siv

                                                                                                                                             ati
                                                                                                            ive

                                                                                                                                           ro

                                                                                                                                        lleg
                                                                                                                                          en

                                                                                                                                                                                 for
                                                                                                           sit

                                                                                                                                          g
                                                                                                                                      to

                                                                                                                                                                                  t
   Harmful code

                                                                                                                                     off

                                                                                                                                                                              ce
                                                                                                                                        n

                                                                                                                                                                                s
                                                                                                                                                                            isin
                                                                                                          en

                                                                                                                                    ea

                                                                                                                                                                            file
                                                                                                                                                                          ere
                                                                                                                                      i
                                                                                                                                  ten
                                                                                                                                   en

                                                                                                                                                                          pa
                                                                                                                                sly
                                                                                                        es

                                                                                                                                                           dm

                                                                                                                                                                        pro
                                                                                                                                 s
                                                                                                               e

                                                                                                                                                                     int
                                                                                                                              ec

                                                                                                                             ea

                                                                                                                              fal

                                                                                                                                                                       es
                                                                                                            rat

                                                                                                                            ive
                                                                                                                       s
                                                                                                       s
                                                                                               e

                                                                                                                   gro

                                                                                                                          ind
                                                                                                   clo

                                                                                                                          thr

                                                                                                                         am

                                                                                                                                                                   ic
                                                                                                                                                          rea

                                                                                                                                                                    ne
                                                                                                           nig

                                                                                                                         ke
                                                                                             us

                                                                                                                                                                    ad
                                                                                                                         ce

                                                                                                                                                                   m
                                                                                                                                                                Clo
   patterns

                                                                                                  Dis

                                                                                                                      Ma

                                                                                                                                                                Inv
                                                                                                                      De

                                                                                                                      Sp

                                                                                                                                                          Sp
                                                                                           Ab

                                                                                                                      Be

                                                                                                                      Be
                                                              Pattern

                                                                                                         De

                                                                                                                 Be
                                                   Action

                                                                                                                                                                Mi
                                                    Follow    Indiscriminate follow
                                                              Whitelist-based follow
                                                              Blacklist-based follow

                Twitter                                       Phantom follow

                                                    Like      Indiscriminate like
                                                              Whitelist-based like

                   Python
                                                              Blacklist-based like
                                                              Mass like

                                                    Tweet     Fixed-content tweet

   Next: patterns search                                      AI-generated tweet
                                                              Trusted source tweet

   engine + web site for                            Mention   Indiscriminate mention

   users                                                      Opt-in mention
                                                              Targeted mention
                                                              Whitelist-based mention
                                                              Blacklist-based mention

   What else can we                                 Retweet   Indiscriminate retweet

   learn from the code of
                                                              Whitelist-based retweet
                                                              Blacklist-based retweet

   bots? Is it possible to
                                                              Mass retweet

                                                    Talk to   Indiscriminate talk

   trace back from                                            Fixed-content talk
                                                              AI-generated talk

   messages to code?                                          Talk with opt-in
                                                              Targeted talk

                                                    Pause     Mimic human
Andrea Millimaggi and Florian Daniel. On Social               Satisfy API contraints
Bots Behaving Badly: Empirical Study of Code
Patterns on GitHub. Submitted for publication to    Store     Store persistently

ICWE 2019, under review.                                                Enables        Prevents          Vulnerable to content abuse              Vulnerable to trust abuse

Conversational screen readers

                                       Amazon
                                       Alexa

                                Google Assistant

 Can we extract conversational knowledge from webpages?
 What about “talking” to websites?

Domain-specific content extraction

                                     How eﬀectively
                                     can we extract
                                     recipes from
                                     free text? And
                                     how do we do
                                     it?

Typical data science steps

              Domain understanding

              Data collection

              Manual inspection of data

              Hypotheses formulation

              Feature engineering, data labeling

              Algorithm engineering (from AI/machine learning
              to statistics)

Online tool   Validation and hypotheses verification

Soft skills

Proposals

Template

Florian Daniel
http://www.floriandaniel.it

You can also read

TURNITIN : GUIDE FOR STUDENTS - REFERENCE DIVISION Perpustakaan Sultan Abdul Samad Universiti Putra Malaysia - UPM library

Mediapack 2019/20 kimberleymediagroup.com - Historic Racing Technology

MEDIA PACK 2020/21 kimberleymediagroup.com - Race Tech Magazine

Webinar Series - Shopping Safely Online and the Work of Cybersecurity Awareness and Behavior Change December 18, 2019 - NIST

Forensics research to make cadaver dogs more efficient - Phys.org

Generation of conjunctivae in a dish - Medical Xpress

Overview of Motivational Interviewing Fundamentals - Grant Hovik, MA - UCLA ISAP Monday, January 25th, 2021

Organs-on-chip - ERIBA Seminar Room 12:30-17:00 June 7th 2018 - 2018 UMCG-hDMT Kick-Off Mini-symposium - NOCI | Netherlands ...

Aerosol, Clouds and Trace gases Research Infrastructure - Session #2: How ongoing projects pave the way to Horizon Europe

2021 SAAS RISK REPORT - This report covers key trends and challenges organizations face when trying to control unsupervised identities and shadow ...

Future radio galaxy surveys - Phil Bull - KICP Workshops & Events

PAYE Modernisation PSDA 18th April 2018 - Revenue

MEDIA KIT 2021 - Tatler Asia Group

A BEGINNER'S GUIDE - Canadian Virtual Gurus

The primary meteoroid flux at the Moon and at the lunar south pole

HBB4ALL: access services for a smart convergent environment - Carlos Alberto Martín, José Manuel Menéndez and Guillermo Cisneros Universidad ...

Entry Level Science Can all milk make you fatter? Instructions and answers for teachers

PR AGENCY // CONTENT FIRST

Compliance Overview 2019 - A quick reference guide for AES product listings - AES-IntelliNet

Sociovocational Integration Services - SVI-P005-1 Where to Look for Work

ALLEGORY; Stories That Shape the Soul 2018-2019

2021 Contact Allison Bolt with any questions 716-280-0766 or - Niagara Hospice

Heating, Ventilation, Air Conditioning, and Refrigeration Technology

WE NEED YOU - Hudson & Thames

LMFDB Computing classical modular forms for the - John Cremona 17 January 2019 Atelier PARI/GP 2019, Bordeaux

2019 ASTRO Advocacy Day Social Media Webinar - Margarita Valdez, ASTRO, Assistant Director, Congressional Relations Daniel Mulligan, ASTRO ...

Media Pack 2020 - Energy Voice

Banque de France feedback on fraud prevention for EEA payments Open Banking & Instant Payments Summit - 24-25-26|MARCH|2021 OB & IP Forum 2021 ...

YOUR RATES EXPLAINED 2021/22 - Golden Plains Shire ...

Library Use by College - LEIGH DUNCAN, WRIGHT STATE UNIVERSITY OH- IUG 2019

The CentOS Ecosystem Karanbir Singh

Diagnosis, follow up and treatment of children with Marfan syndrome - Laura Muiño-Mosquera, MD Division of Paediatric Cardiology and Centre for ...

London-Paris Leisure Tourism Campaign - London & Partners

Detection of spore-forming bacteria in dairy products - Paul Cotter/Kieran Jordan

EFPRA CONGRESS JUNE 2019 CONSUMER DEMANDS IN THE EUROPEAN PETFOOD INDUSTRY - PHIL JONES INGREDIENTS PROCUREMENT DIRECTOR NESTLE PURINA PETCARE ...

Embedded Memory Anomaly Detection Leading to Operational Security - 2017 INMM Novel Technologies Workshop

WEBEOC SOAP API WEBEOC SOAP API - HUBSPOT

Getting people back to work - Partnering to help people impacted by COVID-19 gain the skills for in-demand roles in a more digital economy

A Subquadratic Algorithm for the Straight Skeleton - David Eppstein Dept. Information and Computer Science

REDFOX TCU Transformer Care Unit

INFORMATION ON THE SALE OF INDUSTRIAL PROPERTY

URGENT FIELD SAFETY NOTICE FLEXMEDICS PATIENT PACKS - BFARM

8th Rail Asset Management Meeting - RAM 2018 St Leon-Rot, Germany February 28th and March 1st, 2018 - Orianda Solutions AG

Netflix puts 'Fortnite' in crosshairs as streaming wars heat up - Phys.org

APPLICATION GUIDE - 2021 ROGERS SCHOLARS PROGRAM - Rockcastle ...

Your Intelligent Enterprise Portal - Product - Digital Hive

Post COVID-19, are there reasons to be optimistic about the South African Economy? - Daniel Staib, Senior Economist, Swiss Re Institute

C# and Visual Elements Naming Guidelines

September, 13 - 17, 2021 - Aachen - Germany www.EOSC-CTSI-2021.com First Announcement - EOSC & CTSI Conferences

Programming languages in 2018 - quick overview - Axon Soft