In the conventional CMAC learning scheme, the correcting amounts of errors are equally distributed into all addressed weight, regardless the temporal credibility of those weights. In order to solve the temporal credit...
The application of reinforcement learning is widely used by multi-agent systems in recent years. An agent uses a multi-agent system to cooperate with other agents to accomplish the given task, and one agent′s behavio...