关注 过往记忆大数据 微信公众号，回复 4244 获取本书下载地址。
his book is devoted to the encounter and interaction of agents such as robots with other agents and describes how they cooperate with their previously unknown teammates, forming an Ad Hoc team. It presents a new algorithm, PLASTIC, that allows agents to quickly adapt to new teammates by reusing knowledge learned from previous teammates. PLASTIC is instantiated in both a model-based approach, PLASTIC-Model and a policy-based approach, PLASTIC-Policy. In addition to reusing knowledge learned from previous teammates, PLASTIC also allows users to provide expert-knowledge and can use transfer learning (such as the new Two Stage Transfer algorithm) to quickly create models of new teammates when it has some information about its new teammates. The effectiveness of the algorithm is demonstrated on three domains, ranging from multi-armed bandits to simulated robot soccer games.
关注 过往记忆大数据 微信公众号，回复 4244 获取本书下载地址。如图书无法下载，请加微信 fangzhen0219 反馈。