Playing Angry Birds with a Domain-Independent PDDL+ Planner

Page created by Diane Romero

World Around

English

Like
Share
Embed
Fullscreen
Slides
Download HTML
Download PDF
Abuse

←

→

Page content transcription

If your browser does not render page correctly, please read the page content below

Playing Angry Birds with a Domain-Independent PDDL+ Planner
                                         Wiktor Piotrowski,1 Roni Stern,1,2 Matthew Klenk,1 Alexandre Perez,1 Shiwali Mohan1 Johan de
                                                                               Kleer,1 Jacob Le,,1
                                                                                     1
                                                                                       Palo Alto Research Center, CA, USA
                                                                             2
                                                                            Ben-Gurion University of the Negev, Beer-Sheva, Israel
                                                            e-mail: {wiktorpi,rstern,aperez,smohan,dekleer,jale}@parc.com;klenk.matt@gmail.com
arXiv:2107.04635v1 [cs.AI] 9 Jul 2021

                                                                   Abstract                               We describe the first successful application of domain-
                                                                                                          independent planning to solve Angry Birds, and demonstrate
                                          This demo paper presents the first system for playing the       that its performance is on par with existing domain-specific
                                          popular Angry Birds game using a domain-independent plan-       approaches.
                                          ner. Our system models Angry Birds levels using PDDL+, a
                                          planning language for mixed discrete/continuous domains. It
                                          uses a domain-independent PDDL+ planner to generate plans        Solving Angry Birds with a PDDL+ Planner
                                          and executes them. In this demo paper, we present the sys-      Our agent, called Hydra, models Angry Birds levels using
                                          tem’s PDDL+ model for this domain, identify key design          PDDL+ (Fox and Long 2006). PDDL+ is a planning lan-
                                          decisions that reduce the problem complexity, and compare       guage that enables modeling problems exhibiting both dis-
                                          the performance of our system to model-specific methods
                                                                                                          crete mode switches and continuous flows. Hydra accepts
                                          for this domain. The results show that our system’s perfor-
                                          mance is on par with other domain-specific systems for Angry    an AngryBirds level from the Science Birds API (Renz et al.
                                          Birds, suggesting the applicability of domain-independent       2019), in the form of a list of labeled objects and their loca-
                                          planning to this benchmark AI challenge. A video explain-       tions, and translates it to a PDDL+ problem. Then, it solves
                                          ing and demonstrating the behavior of our system is available   this problem using UPMurphi (Della Penna et al. 2009), a
                                          at https://bit.ly/35065UZ.                                      well-known domain-independent PDDL+ planner.
                                                                                                             The objects in our PDDL+ model of Angry Birds are sep-
                                                                                                          arated into four types: birds, pigs, blocks, and platforms. Ob-
                                                     Background: Angry Birds                              jects’ properties such as locations, width, height, and veloci-
                                        Angry Birds is a wildly popular mobile game in which the          ties are modeled as functions. The slingshot, which is where
                                        objective is to destroy pigs by launching birds at them from      the birds are launched from, is not modeled explicitly as an
                                        a slingshot. The pigs may be protected by structures built        object in the domain. Instead, we assigned the slingshot’s co-
                                        from blocks of various types. Each level of the game has          ordinates to every bird before it is launched. Avoiding mod-
                                        an allotted number of birds that can be used, and pigs that       eling unnecessary objects simplifies planning. The PDDL+
                                        need to be killed in order to solve the level. Players earn       domain of Angry Birds features a variety of dynamics which
                                        points by destroying pigs and blocks as well as for each          dictate the change and evolution of the system. We mod-
                                        unused bird after the level has been solved. This game is         eled these dynamics through PDDL+ actions, events, and
                                        an interesting testbed for automated planning research since      processes constructs. PDDL+ events represent the system’s
                                        it requires reasoning about sequential actions with complex       mode switches which instantaneously change the dynamics
                                        dynamics (Renz et al. 2019). Unlike classical planning, the       of the modeled system, whereas PDDL+ processes represent
                                        agent’s actions often triggers a cascading chain of events        processes that evolve the system over time, dictated by a set
                                        causing drastic changes to the world. The Angry Birds             of ordinary differential equations.
                                        game is inherently a non-trivial hybrid system, exhibiting        Actions Angry Birds features two types of actions: (1)
                                        both discrete and continuous behavior, which includes non-        launching a bird from the slingshot at a chosen angle, and (2)
                                        linear dynamics. Game levels often contain a large num-           activating a bird’s special power (if available). Currently, we
                                        ber of objects, which vary in their type and functionality.       only consider the basic behavior of birds without their spe-
                                        From a computational complexity perspective, different ver-       cial powers. Launching of the bird consists of choosing an
                                        sions of the game have proven to be NP-Hard, PSPACE-              angle, adjusting the launch velocity, and releasing the bird.
                                        complete, and EXPTIME-hard (Stephenson, Renz, and Ge              To mitigate the complexity of the launch action we fix the
                                        2020), Most existing AI agents for this game are based            launch velocity to its maximum possible value, removing the
                                        on encoding domain-specific strategies and selecting from         need for the agent deliberating over the power adjustment.
                                        them (Borovicka, Spetlik, and Rymes 2014; Wang 2017).             This decision is motivated by the fact that maximum veloc-
                                                                                                          ity shots are the most common as they provide widest range
                                        Copyright © 2021, Association for the Advancement of Artificial   of targets and impact velocity is proportional to damage. In
                                        Intelligence (www.aaai.org). All rights reserved.                 addition, we paired the release action with a supporting pro-

Agent Problems Solved Avg. Score per Level pre-defined strategies (e.g., destroy pigs, target TNT, target
ANU 49 45,112 round blocks, destroy structures) and choosing the strategy
Hydra 44 47,516 that will yield the maximal predicted utility. Eagle’s Wing
DataLab 26 46,668 also uses a pre-trained XGBoost model to optimize its per-
Eagle’s Wing 16 39,152
formance. All agents are designed to work with the Science-
Birds API. For the evaluation, each agent attempted to solve
Table 1: Levels solved and avg. scores given a 60 sec. time
a set of 100 randomly generated Angry Birds levels, which
limit.
can be found at gitlab.com/aibirds/sciencebirdsframework.
cess that continuously adjusts the angle as soon as a new We compared the results of all agents to a baseline perfor-
bird is placed on the slingshot and is ready to be launched. mance published by ANU, which was computed by aver-
Thus, to launch a bird the agent only needs to choose when aging the scores of DataLab, Eagle’s Wing, and an ANU-
to release it instead of at which angle to do so. created naive agent. The latter targets a randomly selected
Events The vast majority of the Angry Birds system me- pig with each active bird, and has predefined windows for
chanics are event-driven. Each object in the game has one or apply special actions per bird type. It only considers the lo-
more associated events which govern their interactions with cations of visible pigs, disregarding all other objects in the
the environment and other objects in the level. They enable scene.
modeling of complex behavior including all collisions, de- Table 1 shows the number of problem solved (out of
structions, explosions, and structure collapses. Events also 100) and the average score per level for each agent and
model supporting features such as loading the next bird into the published baseline results (labelled “ANU”). As can be
the slingshot or exploding the currently active bird. Model- seen, Hydra’s performance is on par with existing domain-
ing the motion of blocks after impact and chains of block- dependent approaches, even without encoding the birds’
block collisions, requires defining a process to track each special powers. It solved more problems than DataLab and
individual block’s change in position and rotation over time, Eagle’s Wing (44 vs. 26 and 16) and obtained the highest
as well as a set of events to account for secondary interac- average score (47,516) per level. It is worth noting that both
tions. Keeping track of dozens of such non-linear processes Eagle’s Wing and DataLab’s were designed for the Science-
in every state after a collision would significantly slow down Birds competition, where the levels were hand-crafted with
the planner, while the improvement in the domain’s accu- specific winning strategies. In contrast, the levels we used
racy is expected to be small. Therefore, we currently avoid were created by ANU’s automated level generator.
modelling falling blocks processes and block-block collision
events. Conclusion
Processes There are two processes in our PDDL+ model: in- We presented an agent that successfully plays the Angry
crease angle and flying. The increase angle process is a lin- Birds game using a domain-independent planner. Our agent
ear increase of the active bird release angle prior to launch- translates Angry Birds levels to PDDL+ problems and solves
ing it, as described above. This process is triggered as soon them using a PDDL+ solver. This demonstrates the expres-
as the next bird in the queue is available for launch. The fly- sive power of PDDL+ and corresponding solvers. We plan
ing process models the flight of the active bird, updating its on further extending our domain to improve its accuracy
velocity and location over time, according to the governing and develop domain-independent heuristics for more effi-
non-linear equations of motion. This process is triggered af- cient PDDL+ planning. In addition, we are exploring tech-
ter a bird is released from the sling. niques to diagnose incorrect PDDL+ models and repair them
Simplified Modeling While the goal of Angry Birds is to automatically (Klenk et al. 2020).
destroy all of the pigs, finding a solution to the correspond-
ing PDDL+ problem was too difficult for our PDDL+ plan- References
ner in some cases. Therefore, Hydra also generates two sim- Borovicka, T.; Spetlik, R.; and Rymes, K. 2014. DataLab Birds
plified PDDL+ problems: (1) a single-shot problem, which Angry Birds AI. Technical report, Czech Technical University.
splits the original scenario into single-bird episodes with the
goal of killing at least one pig, and (2) a single-shot-no- Della Penna, G.; Magazzeni, D.; Mercorio, F.; and Intrigila, B.
2009. UPMurphi: a tool for universal planning on PDDL+ prob-
blocks problem, which only considers platforms and pigs. lems. In International Conference on Automated Planning and
In our current implementation, we first attempt to generate Scheduling.
plans for the single-shot problem. If no plan is found in 30s,
we halt the planner and attempt to generate plans for the Fox, M.; and Long, D. 2006. Modelling mixed discrete-continuous
domains for planning. Journal of Artificial Intelligence Research
single-shot-no-blocks problem. If again, no plan is found in 27: 235–297.
30s, we execute a pre-defined default action.
Klenk, M.; Piotrowski, W.; Stern, R.; Mohan, S.; and de Kleer, J.
Experiments 2020. Model-Based Novelty Adaptation for Open-World AI. In
We evaluated Hydra against two state-of-the-art agents: International Workshop on Principles of Diagnosis (DX).
DataLab (Borovicka, Spetlik, and Rymes 2014), and Ea- Renz, J.; Ge, X.; Stephenson, M.; and Zhang, P. 2019. Ai meets
gle’s Wing (Wang 2017). DataLab won the 2014 & 2015 angry birds. Nature Machine Intelligence 1(7): 328–328.
ScienceBirds competition and Eagle’s Wing won the 2016- Stephenson, M.; Renz, J.; and Ge, X. 2020. The computational
2018 competitions. Both agents work by trying a set of complexity of Angry Birds. Artificial Intelligence 280: 103232.

Wang, T. J. 2017. Description for Eagle’s Wing, an AI Angry
Bird Playing Agent. Technical report, University of Waterloo.
Http://www.aibirds.org/2017/eaglewing2017.pdf.

This figure "hydra-results-sb-cropped.png" is available in "png" format from:

        http://arxiv.org/ps/2107.04635v1

This figure "hydra-results-sb.png" is available in "png" format from:

        http://arxiv.org/ps/2107.04635v1

You can also read

Career Pathways and Employment Assistance

Anger - Off The Record Bath

HOW CAN YOU BRING YOUR VISION ZERO TO LIFE?

Avian Pox in Garden Birds

THE L WORD: GENERATION Q CARIBBEAN RESORT VACATION - MAY 1-8, 2021 Club Med's ALL-INCLUSIVE

SYSTEMS LANDSCAPE FOR OPTICAL INTERCONNECT - Mike Woodacre HPE Fellow CTO for High-Performance Computing and Mission Critical Systems ...

TII Statement of Strategy 2018 to 2022 - March 2018 - Transport Infrastructure Ireland

CONTRIBUTION OF THERMAL-HYDRAULICS SIMULATION TO CRITICALITY ANALYSIS OF A DEBRIS BED NUMERICAL MOCK-UP

Whole Tablet Measurements Using the Spectrum One NTS Tablet Autosampler System

Heathrow and Richmond Park

BUILDING / UPGRADING A SECONDARY SUITE - District of ...

Augmented Reality Approach to Domestic Maintenance Tasks

MI6 USER'S MANUAL - SIX CHANNEL ALARM AND SHUTDOWN SYSTEM - Axiom Technologies, LLC

World Fair Trade Organization Constitution 2021

AUTONOMOUS DRONE-IN-A-BOX SOLUTIONS - Surveillance, Monitoring and Inspections - Easy Aerial

Improving Health and Wellbeing in Cheshire and Merseyside - Strategy 2021-2025 - Cheshire & Merseyside Health & Care ...

RADIANT SYSTEMS OFFER VERSATILITY AND ENERGY SAVINGS - 2021 SPRING EDITION - Rehau

MORNING GRAIN COMMENTARY - Leese Group Trading

VOICE.VIDEO.BOT TIME TO UPGRADE TO HALOOCOM

PRODUCT PORTFOLIO 2018 - STARKEYPRO

PRICELIST - Feel the difference - Robeta

ST. CATHARINES, ON. June 8-10 "

ULTRAVAC+ VACUUM/DRYING CHAMBER TRAINING MANUAL - HEARING INSTRUMENT RESTORATION SYSTEM

An information system for historical Cadastre of Venice

The RTE-3 Extended Task - Hoa Dang Ellen Voorhees

Video Measuring Microscopes - for precision 3-axis measurement

Academy of Hospitality and Tourism-Highlights

PSI Corporate Strategy 2021 2023 - Pharmaceutical ...

SECURE QR BASED POSTAL DELIVERY SYSTEM USING GOGLE FIREBASE APP MANAGEMENT MODEL - IJCRT

Advanced method of managing soil conservation works in Smart Farms

Analyzing Complex Predicates: Syntactic Evidences from Sampark and Google Machine Translation Systems

HEALTHCARE NEWS - Wooltru

St Mary's RC Primary School Art and Design Policy - Approved: Review date: Sept 2018 - St Marys ...

DIMENSIONS ANALYTICS - THE BASICS - Dimensions Analytics (April 2021)

THE EFFECT OF DIGITAL TECHNOLOGY ON THE RAIL CUSTOMER EXPERIENCE - APTA EMERGING LEADERS CAPSTONE PROJECT CLASS OF 2019 GROUP 2

System Status Briefing - Jan Oberholzer Chief Operating Officer January 2020 - Kwanalu

Prinect Color Toolbox 2021 - Installation. Equipment

UCI Road World Championships 2013 - CMWE

BUSINESS TRANSFORMATION IN OPERATION(S) - ADM MATURITY CHECK - Frank Luyckx & René Overwater, februari 2015, Den Bosch VNSG Focusgroep Advanced ...

Building the foundations of a Restoration Economy - raising investment for habitat banks - Prof David Hill CBE Chairman, Environment Bank

MANUAL - MANUAL COVERS LIONCORE MODELS, 55111 - 551117 VPWLLC.COM - VERTICAL PARTNERS WEST

Global Information Assurance Certification Paper

Challenges in Fault-Tolerance for Peta-ExaScale systems and research opportunities - Franck Cappello

VERIZON CALL CENTER Investment Opportunity

NOISE ANALYSIS OF NIKON D40 DIGITAL STILL CAMERA

NEBRASKA AUDITOR OF PUBLIC ACCOUNTS - Charlie Janssen State Auditor

Freestanding Pellet Stove with Water Jacket 2-12 kW

LOMONACO's IRON CONCEPTs - ALUMINUM & SECURITY STORM DOORS 2021 - Lomonaco's Iron Concepts

ARGENTINA'S COUNTERCYCLICAL CREDIT POLICY RESPONSE: MACROPRUDENTIAL REGULATION AND PUBLIC BANK CREDIT DURING COVID-19

Enterprises Must Adapt to Address Telework Security Challenges - 2020 Remote Workforce Cybersecurity Report - Fortinet