CPC B60W 60/0011 (2020.02) [G06N 3/04 (2013.01)] | 20 Claims |
1. A method for optimizing decision-making regulation and control, comprising:
obtaining a first trajectory sequence that comprises trajectory information of a vehicle in a first environment;
obtaining first target driving behavior information output by a behavior decision-making layer of a decision-making and control system based on information about the first environment;
combining the first trajectory sequence and the first target driving behavior information to obtain a first traveling sequence;
obtaining a second trajectory sequence output by a motion planning layer of the decision-making and control system based on preset second target driving behavior information;
combining the second trajectory sequence and the second target driving behavior information to obtain a second traveling sequence;
optimizing the behavior decision-making layer based on a difference between the first traveling sequence and a preset target teaching traveling sequence, wherein the target teaching traveling sequence comprises a teaching trajectory sequence and teaching driving behavior information; and
optimizing the motion planning layer based on a difference between the second traveling sequence and the target teaching traveling sequence.
|