SKIPP — End-to-end Sketch-Guided Path Planning through Imitation Learning

Abstract

Path planning is crucial for Autonomous Mobile Robots applications. Traditionally, path planning based on human input and preferences has relied on hard-to-define reward-based learning or costly techniques requiring additional hardware. This work introduces a more accessible and flexible approach through sketch-guided imitation learning, where nontechnical users can simply draw the desired navigational path on a provided 2D map, which is then used to teach U-net models path planning behaviors. Additionally, the work draws on metrics from the fields of image generation and robotics to provide a novel evaluation framework. The approach is integrated into an end-to-end robotics stack to demonstrate its usability.

Image Generation Local Path Planning Deep Learning U-Net Imitation Learning

Proposed Method

SKIPP frames path planning as an imitation learning problem. Instead of relying on complex reward engineering, users draw the desired navigation behavior directly on a 2D map. These sketches are post-processed and used to train a lightweight U-Net model through Behavioral Cloning. The trained model is integrated within NVIDIA's ISAAC SDK for real-time end-to-end navigation.

Input Encoding

Each input instance is a multi-channel image encoding:

Occupancy grid — obstacles in white, navigable areas in grey, unobserved in black
Goal position — overlayed as a blue rectangle
Robot position — encoded as a red rectangle
Expert path — hand-drawn sketch processed and overlayed in green

Input encoding: occupancy grid with start, goal, and expert path overlayed.

Model Architecture

The network uses a U-Net architecture with an encoder-decoder structure and skip connections. The encoder progressively downsamples the input while capturing contextual features. The decoder restores fine-grained spatial details. A Sigmoid activation produces a binary mask representing the planned path. The model is trained with Binary Cross Entropy loss.

SKIPP Model Architecture — encoder-decoder with skip connections.

Expert Behaviors

The dataset is divided into two behaviors commonly demanded in industrial AMR use cases. The annotation tool allows experts to intuitively draw desired navigation paths which are then post-processed (thinning, DBSCAN clustering, spline smoothing) into actionable trajectories.

L-shape behavior

U-shape behavior

Results

Monte Carlo Cross-Validation with k=50 random seeds was used to evaluate stability. The model was trained for 160 epochs with batch size 32 using Adam optimizer on an NVIDIA Quadro RTX 8000.

Behavior	FID ↓		Mean-APE (m) ↓		RMSE-ATE (m) ↓		Inference (ms) ↓
Behavior	mean	std	mean	std	mean	std	mean	std
L-shape	50.25	7.11	0.050	0.014	0.055	0.014	5.28	0.09
U-shape	44.00	7.00	0.053	0.015	0.059	0.016	5.32	0.10

47.13

Avg. FID Score

High similarity to expert paths

0.051m

Mean APE

Positional accuracy

0.055m

RMSE-ATE

Trajectory deviation

5.3ms

Inference Time

Real-time capable

FID Quality Comparison

FID 254 — Poor

FID 47 — Good

Trajectory Accuracy (APE Analysis)

Color-coded trajectory error along the navigation path compared to expert data (dashed lines). Both examples show very low mean APE values with minimal standard deviation, indicating robust and accurate path generation.

L-shape trajectory — APE 0.017m

U-shape trajectory — APE 0.007m

Generation Examples

Citation

@INPROCEEDINGS{10924509,
  author={Rizk, Anthony and Abi Hana, Charbel and Bakouny, Youssef and Khatounian, Flavia},
  booktitle={2024 IEEE International Conference on Mechatronics and Automation (ICMA)}, 
  title={End-to-end Sketch-Guided Path Planning through Imitation Learning for Autonomous Mobile Robots}, 
  year={2024},
  pages={777-782},
  doi={10.1109/ICMA61710.2024.10924509}
}

End-to-end Sketch-Guided Path Planning through Imitation Learning for Autonomous Mobile Robots

Abstract

Proposed Method

Input Encoding

Model Architecture

Expert Behaviors

Results

FID Quality Comparison

Trajectory Accuracy (APE Analysis)

Generation Examples

Resources

GitHub Repository

Expert Dataset

SKIPP U-Shape Model

SKIPP L-Shape Model

Citation