作者: Adenoma 時(shí)間: 2025-3-21 21:21
Frederick Steiner,Gerald Young,Ervin Zubeing of value function. Through quantifying the uncertainty of state-action value estimation, we selectively erase the entries of highly uncertain values in state-action value matrix and conduct low-rank matrix reconstruction for them to recover their values. Such a reconstruction exploits the underl作者: 引水渠 時(shí)間: 2025-3-22 03:13
https://doi.org/10.1007/978-3-030-02786-5 unique characteristics of each agent. Besides, to overcome the consistent agent problem of NCC, a split loss is introduced to distinguish different agents and reduce the number of groups. Results reveal that the proposed method makes excellent coordination and achieves a significant improvement in 作者: Blatant 時(shí)間: 2025-3-22 04:34
Júnia Schultz,Alexandre Soares Rosadorantees when used by SyncBB when elicitations are free. Our model and heuristics thus extend the state-of-the-art in distributed constraint reasoning to better model and solve distributed agent-based applications with user preferences.作者: Neonatal 時(shí)間: 2025-3-22 11:07 作者: 熱心助人 時(shí)間: 2025-3-22 15:25
Frank C. Bellrose,Nannette M. Trudeaur, build trust, and thwarting the competition. We present analysis from in-house comparative evaluation of FUN-Agent against three well-known agent negotiators: Boulware, Conceder, and Linear Conceder.作者: 熱心助人 時(shí)間: 2025-3-22 19:39 作者: 河流 時(shí)間: 2025-3-23 00:54 作者: 惡臭 時(shí)間: 2025-3-23 05:13
,BGC: Multi-agent Group Belief with?Graph Clustering, unique characteristics of each agent. Besides, to overcome the consistent agent problem of NCC, a split loss is introduced to distinguish different agents and reduce the number of groups. Results reveal that the proposed method makes excellent coordination and achieves a significant improvement in 作者: 易達(dá)到 時(shí)間: 2025-3-23 06:38 作者: galley 時(shí)間: 2025-3-23 11:32
MARL for Traffic Signal Control in Scenarios with Different Intersection Importance,rsections. Specifically, the leader-follower paradigm control intersections in a traffic scenario by two kinds of agents, .., leader agent controlling intersections that need special attention, and follower agents controlling ordinary intersections. Then a multi-agent reinforcement learning framewor作者: Halfhearted 時(shí)間: 2025-3-23 16:21 作者: 橫條 時(shí)間: 2025-3-23 20:10 作者: 干旱 時(shí)間: 2025-3-24 02:10
,Uncertainty-Aware Low-Rank ,-Matrix Estimation for?Deep Reinforcement Learning,ent fields, the underlying structure and learning dynamics of value function, especially with complex function approximation, are not fully understood. In this paper, we report that decreasing rank of .-matrix widely exists during learning process across a series of continuous control tasks for diff作者: isotope 時(shí)間: 2025-3-24 04:19 作者: SPURN 時(shí)間: 2025-3-24 09:43
,BGC: Multi-agent Group Belief with?Graph Clustering,sks. Most current methods assume that agents can communicate to assist decisions, which is impractical in some real situations. In this paper, we propose an observation-to-cognition method to enable agents to realize high efficient coordination without communication. Inspired by the neighborhood cog作者: brachial-plexus 時(shí)間: 2025-3-24 13:55 作者: 母豬 時(shí)間: 2025-3-24 15:44
Securities Based Decision Markets,on scoring rules have been proven to offer incentive compatibility analogous to properly incentivised prediction markets. However, in contrast to prediction markets, it is unclear how to implement decision markets such that forecasting is done through the trading of securities. We here describe such作者: 寒冷 時(shí)間: 2025-3-24 21:16 作者: averse 時(shí)間: 2025-3-25 01:33 作者: outset 時(shí)間: 2025-3-25 03:20
The Positive Effect of User Faults over Agent Perception in Collaborative Settings and Its Use in A Through a series of extensive experiments we find that user faults make the user more tolerant to agent faults, and consequently more satisfied with the collaboration, in particular compared to the case where the user is performing faultlessly. This finding can be utilized for improving the design 作者: figment 時(shí)間: 2025-3-25 07:31
Behavioral Stable Marriage Problems,dent doctors to hospitals and students to schools. Several preference models have been considered in the context of SMPs including orders with ties, incomplete orders, and orders with uncertainty, but none have yet captured behavioral aspects of human decision making, e.g., contextual effects of cho作者: ARY 時(shí)間: 2025-3-25 13:10
FUN-Agent: A HUMAINE Competitor,s have been pushing the frontier of new modalities of peer-level and ad-hoc human agent collaboration?[., .]. We are particularly interested in research on agents representing human users in negotiating deals with other human and autonomous agents?[., ., .]. Here we present the design for the conver作者: Horizon 時(shí)間: 2025-3-25 18:10 作者: Madrigal 時(shí)間: 2025-3-25 22:11 作者: Brocas-Area 時(shí)間: 2025-3-26 02:23
,Combining M-MCTS and?Deep Reinforcement Learning for?General Game Playing,ed on game rules without human intervention. Most recent work has successfully applied deep reinforcement learning to GGP. This paper continues this line of work by integrating the Memory-Augmented Monte Carlo Tree Search algorithm (M-MCTS) with deep reinforcement learning for General Game Playing. 作者: 割公牛膨脹 時(shí)間: 2025-3-26 06:03
,A Two-Step Method for?Dynamics of?Abstract Argumentation,tion framework by the intersection of expansions of the conflict-free labellings of argumentation frameworks before updating. Then we select the conflict-free labellings which have the least illegally labelled arguments when restricted to part of both argumentation frameworks in the update process. 作者: 領(lǐng)巾 時(shí)間: 2025-3-26 12:07
New Introduction by David Lowenthal (2003)y selectively revealing information to players in order to influence their actions. Most previous studies have focused on the . question of designing optimal signaling schemes. This work departs from previous research by considering a . question, and looks to quantitatively characterize the . (.), i作者: Exaggerate 時(shí)間: 2025-3-26 14:37
Frederick Steiner,Gerald Young,Ervin Zubeent fields, the underlying structure and learning dynamics of value function, especially with complex function approximation, are not fully understood. In this paper, we report that decreasing rank of .-matrix widely exists during learning process across a series of continuous control tasks for diff作者: bibliophile 時(shí)間: 2025-3-26 19:10 作者: 豎琴 時(shí)間: 2025-3-26 21:14
https://doi.org/10.1007/978-3-030-02786-5sks. Most current methods assume that agents can communicate to assist decisions, which is impractical in some real situations. In this paper, we propose an observation-to-cognition method to enable agents to realize high efficient coordination without communication. Inspired by the neighborhood cog作者: travail 時(shí)間: 2025-3-27 03:05
Júnia Schultz,Alexandre Soares Rosado A key assumption in this model is that all constraints are fully specified or known a priori, which may not hold in applications where constraints encode preferences of human users. In this paper, we extend the model to . (I-DCOPs), where some constraints can be partially specified. User preference作者: 遺傳 時(shí)間: 2025-3-27 08:10 作者: 伸展 時(shí)間: 2025-3-27 10:33
Leonard Sandin,Piet F. M. Verdonschotress. However, those methods assume that all agents in the cooperative games are isomorphic, which ignores the situation that different agents can play heterogeneous roles in the ATSC scenario. The tolerance of vehicles at different intersections in the same area is different, .., traffic congestion作者: BOLUS 時(shí)間: 2025-3-27 15:08 作者: motor-unit 時(shí)間: 2025-3-27 19:03
Karel Brabec,Krzysztof Szoszkiewicz Through a series of extensive experiments we find that user faults make the user more tolerant to agent faults, and consequently more satisfied with the collaboration, in particular compared to the case where the user is performing faultlessly. This finding can be utilized for improving the design 作者: CLOT 時(shí)間: 2025-3-28 00:10
Frank C. Bellrose,Nannette M. Trudeaudent doctors to hospitals and students to schools. Several preference models have been considered in the context of SMPs including orders with ties, incomplete orders, and orders with uncertainty, but none have yet captured behavioral aspects of human decision making, e.g., contextual effects of cho作者: Feature 時(shí)間: 2025-3-28 02:59
Frank C. Bellrose,Nannette M. Trudeaus have been pushing the frontier of new modalities of peer-level and ad-hoc human agent collaboration?[., .]. We are particularly interested in research on agents representing human users in negotiating deals with other human and autonomous agents?[., ., .]. Here we present the design for the conver作者: Anemia 時(shí)間: 2025-3-28 09:32
Aquatic Habitats of Breeding Waterfowl centralized learning with decentralized execution framework, their decentralized execution paradigm limits the agents’ capability to coordinate. Inspired by the concept of correlated equilibrium, we propose to introduce a . to address this limitation, and theoretically show that following mild cond作者: 爆炸 時(shí)間: 2025-3-28 12:49 作者: MIME 時(shí)間: 2025-3-28 14:52
Patterns of diversity in the boreal forested on game rules without human intervention. Most recent work has successfully applied deep reinforcement learning to GGP. This paper continues this line of work by integrating the Memory-Augmented Monte Carlo Tree Search algorithm (M-MCTS) with deep reinforcement learning for General Game Playing. 作者: 胎兒 時(shí)間: 2025-3-28 19:30
Patterns of diversity in the boreal foresttion framework by the intersection of expansions of the conflict-free labellings of argumentation frameworks before updating. Then we select the conflict-free labellings which have the least illegally labelled arguments when restricted to part of both argumentation frameworks in the update process. 作者: 大廳 時(shí)間: 2025-3-29 00:19 作者: LASH 時(shí)間: 2025-3-29 03:48
Distributed Artificial Intelligence978-3-030-94662-3Series ISSN 0302-9743 Series E-ISSN 1611-3349 作者: Congestion 時(shí)間: 2025-3-29 09:54
https://doi.org/10.1007/978-3-030-94662-3artificial intelligence; autonomous agents; computer networks; computer science; computer systems; educat作者: scrape 時(shí)間: 2025-3-29 14:33
978-3-030-94661-6Springer Nature Switzerland AG 2022作者: BABY 時(shí)間: 2025-3-29 17:13 作者: GRIEF 時(shí)間: 2025-3-29 20:25
Safe Distributional Reinforcement Learning,tributional RL perspective leads to a more efficient algorithm while additionally catering for natural safe constraints. We empirically validate our propositions against appropriate state-of-the-art safe RL algorithms.作者: dagger 時(shí)間: 2025-3-30 00:29
The Positive Effect of User Faults over Agent Perception in Collaborative Settings and Its Use in Aof collaborative agents. In particular, we present a proof-of-concept for such augmented design, where the agent, whenever in charge of allocating the tasks or can pick its own tasks, deliberately leave the user with a relatively difficult task for increasing the chance for a user fault, which in turn increases user satisfaction.作者: Indebted 時(shí)間: 2025-3-30 04:43 作者: 惡名聲 時(shí)間: 2025-3-30 10:56 作者: 確保 時(shí)間: 2025-3-30 14:05
Frank C. Bellrose,Nannette M. Trudeau analyze the computational complexity of BSMPs and show that proposal-based approaches are affected by contextual effects. We then propose and evaluate novel ILP and local-search-based methods to efficiently find optimally stable and fair matchings for BSMPs.作者: 幻想 時(shí)間: 2025-3-30 19:44 作者: LATER 時(shí)間: 2025-3-30 21:47
0302-9743 in Shanghai, China, in December 2021..The 15 full papers presented in this book were carefully reviewed and selected from 31 submissions. DAI aims at bringing together international researchers and practitioners in related areas including general AI, multiagent systems, distributed learning, computa作者: Notorious 時(shí)間: 2025-3-31 01:37
Conference proceedings 2022ogether international researchers and practitioners in related areas including general AI, multiagent systems, distributed learning, computational game theory, etc., to provide a single, high-profile, internationally renowned forum for research in the theory and practice of distributed AI..作者: PAD416 時(shí)間: 2025-3-31 07:46
Karel Brabec,Krzysztof Szoszkiewicztributional RL perspective leads to a more efficient algorithm while additionally catering for natural safe constraints. We empirically validate our propositions against appropriate state-of-the-art safe RL algorithms.作者: dragon 時(shí)間: 2025-3-31 10:24
Karel Brabec,Krzysztof Szoszkiewiczof collaborative agents. In particular, we present a proof-of-concept for such augmented design, where the agent, whenever in charge of allocating the tasks or can pick its own tasks, deliberately leave the user with a relatively difficult task for increasing the chance for a user fault, which in turn increases user satisfaction.作者: 冬眠 時(shí)間: 2025-3-31 14:41
Patterns of diversity in the boreal forestFinally, we prove the soundness and completeness of our method under the complete semantics. In other words, the complete labellings of the resulted argumentation framework after update is the same as that of our method.