Utility- and Plan-based Action Selection based on Probabilis(3)

来源：网络收集时间：2021-01-20 下载这篇文档手机版

说明：文章内容仅供预览，部分内容可能不全，需要完整文档或者需要复制内容，请下载word后使用。下载word有问题请添加微信号:或QQ：处理（尽可能给您提供完整文档），感谢您的支持与谅解。

Abstract. This paper describes the AGILO RoboCuppers 1 the RoboCup team of the image understanding group (FG BV) at the Technische Universit?t München. With a team of four Pioneer I robots, all equipped with CCD camera and a single board computer, we’ve

tofasterrecovertheirpositionsaftertheyhavelosttrackofthem.Adetaileddescrip-tionoftheselflocalizationalgorithmcanbefoundin[8]andthealgorithmsusedforcooperativemulti-objecttrackingareexplainedin[13,12].

Ourvisionalgorithmscanprocessupto25framespersecond(fps)ona200MHzPentiumPC.Theaveragenumberofimagesprocessedduringamatchisbetween12and17fps.Thisisduetocomputationalresourcesbeingsharedwiththepathplanningandactionselectionmodules.

3.2ExperienceBasedLearningforSituatedActionSelection,PathPlanning

andMovementControl

Anothermajor eldofourresearchactivitiesisautomaticrobotlearningbasedonexperiencesgainedfromexploration.Experiencebasedlearningprovidesapowerfultoolfortheautomaticconstructionofhigh-performanceactionselectionandlow-levelrobotcontrol.Inthisrespectexperiencebasedlearningcaneffectivelycomplementothermethodsfordevelopingsuchcontrollers,inparticularthehandcodingofcon-trollers.Weuselearningfromexperienceinseveralpartsofoursystemsuchaslowlevelrobotcontrol,pathplanningandactionselection.

InlowlevelrobotcontrolwerepresentthestateofaPioneerIrobotasaquintuple

,whereandarecoordinatesinaglobalsystem,istheorienta-tionoftherobotandandarethetranslationalandrotationalvelocities,respectively.

Thelow-levelrobotcontrolleracceptscommandsoftheform.Aneuralnet-workmapsthedesiredstatechangestolowlevelrobotcommands:

Totrainthisnetworkwemeasureahugenumberofstatechangesaccordingtodifferentexecutedlowlevelcommands[6].Doingsoourneuralcontrollerisbasedonnothingbutexperiencenotmakinganyassumptions.

Inorderto ndtheoptimalpathplanningalgorithmforourRoboCuprobotswesta-tisticallyevaluateddifferentmethodsandfoundoutthatthereisnooptimalalgorithmbutanumberofnavigationproblemclasseseachperformedbestwithacertainalgo-rithm/parameterization[6].Theseclassesarede nedwiththehelpofafeaturelan-guage.Inordertoselectthebestmethodforthegivensituationwe’velearnedadecisiontree[11].Thetrainingdataisobtainedfromaccuraterobotsimulationswhereahugenumberofpathplanningproblemswereperformedwithdifferentalgorithmseach.

Theselectionofanappropriateactionisperformedonthebasisofafusedenvi-ronmentalmodel.Asetofpossibleactionssuchasgo2ball,shoot2goal,dribble,block...isde ned.Forallrobotsandeachofthoseactionssuccessrates

[5].Fromallpromisingactions,whichexceedapre-andgainsareestimatedde nedthresholdtheonewiththehighestgainischosentobecarriedout.

3.3Plan-basedActionControl

Whileoursituatedactionselectionaimsatchoosingactionsthathavethehighestex-pectedutilityintherespectivesituationitdoesnottakeintoaccountastrategicassess-mentofthealternativeactionsandtherespectiveintentionsoftheteammates.Thisisthetaskoftheplan-basedactioncontrol.

Inordertorealizeanactionassessmentbasedonstrategicconsiderationandonaconsiderationsoftheintentionsoftheteammates,wedeveloparobotsoccerplaybook,alibraryofplanschematathatspecifyhowtoperformindividualteamplays.Theplans,orbetterplays,aretriggeredbyopportunities,forexample,theopponentteamleaving.

百度搜索“77cn”或“免费范文网”即可找到本站免费阅读全部范文。收藏本站方便下次阅读，免费范文网，提供经典小说教育文库Utility- and Plan-based Action Selection based on Probabilis(3)在线全文阅读。

Utility- and Plan-based Action Selection based on Probabilis(3).doc 将本文的Word文档下载到电脑，方便复制、编辑、收藏和打印下载失败或者文档不完整，请联系客服人员解决！

下载这篇word文档

本文链接：https://www.77cn.com.cn/wenku/jiaoyu/1177589.html（转载请注明文章来源）

上一篇：2010年建设工程项目管理基础知识点111
下一篇：会议室禁烟标志