11493926

Offline Agent Using Reinforcement Learning to Speedup Trajectory Planning for Autonomous Vehicles

PublishedNovember 8, 2022
Assigneenot available in USPTO data we have
Technical Abstract

Patent Claims
18 claims

Legal claims defining the scope of protection, as filed with the USPTO.

2

2. The method of claim 1, wherein the plurality of discretized control action options are generated based on a vehicle dynamic model for autonomous driving.

3

3. The method of claim 1, wherein the plurality of discretized trajectory state options are generated by discretizing a region of interest for the driving scenario in view of a final destination trajectory state.

4

4. The method of claim 1, wherein the judgment score includes scores representing whether the trajectory ends at a planned destination state, the trajectory is smooth, and the trajectory avoids one or more obstacles of an environment model.

5

5. The method of claim 1, wherein the driving scenario includes one or more regions of interest (ROIs).

6

6. The method of claim 1, wherein the RL agent includes an actor neural network and a critic neural network, and wherein the actor and critic neural networks are deep neural networks.

7

7. The method of claim 6, wherein the actor neural network includes a convolutional neural network.

9

9. The non-transitory machine-readable medium of claim 8, wherein the plurality of discretized control action options are generated based on a vehicle dynamic model for autonomous driving.

10

10. The non-transitory machine-readable medium of claim 8, wherein the plurality of discretized trajectory state options are generated by discretizing a region of interest for the driving scenario in view of a final destination trajectory state.

11

11. The non-transitory machine-readable medium of claim 8, wherein the judgment score includes scores representing whether the trajectory ends at a planned destination state, the trajectory is smooth, and the trajectory avoids one or more obstacles of an environment model.

12

12. The non-transitory machine-readable medium of claim 8, wherein the driving scenario includes one or more regions of interest (ROIs).

13

13. The non-transitory machine-readable medium of claim 8, wherein the RL agent includes an actor neural network and a critic neural network, and wherein the actor and critic neural networks are deep neural networks.

14

14. The non-transitory machine-readable medium of claim 13, wherein the actor neural network includes a convolutional neural network.

16

16. The data processing system of claim 15, wherein the plurality of discretized control action options are generated based on a vehicle dynamic model for autonomous driving.

17

17. The data processing system of claim 15, wherein the plurality of discretized trajectory state options are generated by discretizing a region of interest for the driving scenario in view of a final destination trajectory state.

18

18. The data processing system of claim 15, wherein the judgment score includes scores representing whether the trajectory ends at a planned destination state, the trajectory is smooth, and the trajectory avoids one or more obstacles of an environment model.

19

19. The data processing system of claim 15, wherein the driving scenario includes one or more regions of interest (ROIs).

20

20. The data processing system of claim 15, wherein the RL agent includes an actor neural network and a critic neural network, and wherein the actor and critic neural networks are deep neural networks.

21

21. The data processing system of claim 20, wherein the actor neural network includes a convolutional neural network.

Patent Metadata

Filing Date

Unknown

Publication Date

November 8, 2022

Inventors

RUNXIN HE
JINYUN ZHOU
QI LUO
SHIYU SONG
JINGHAO MIAO
JIANGTAO HU
YU WANG
JIAXUAN XU
SHU JIANG

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Citation & reuse

Analysis on this page is generated by Patentable — an AI-powered patent intelligence platform. AI-generated summaries, explanations, and analysis may be reused with attribution and a visible link back to the canonical URL below. Patent abstracts and claims are USPTO public domain.

Cite as: Patentable. “OFFLINE AGENT USING REINFORCEMENT LEARNING TO SPEEDUP TRAJECTORY PLANNING FOR AUTONOMOUS VEHICLES” (11493926). https://patentable.app/patents/11493926

© 2026 Patentable. All rights reserved.

Patentable is a research and drafting-assistant tool, not a law firm, and does not provide legal advice. Documents we generate are drafts for review by a licensed patent attorney.