Robotic Autofocus for Visual Inspection — Presentation Outline

Ratio-Based Velocity Control for Robotic Macro Imaging in Precision Manufacturing
Santoso, Acton, Back, Chen — MACS Lab, University of Washington

Target: ~15 min talk (~13 slides, ~1 min each). Bullets below summarize what to say per slide.

Slide 1 — Title & Motivation Hook

Robotic autofocus algorithm for visual inspection in precision manufacturing
Key claim: reaches optimal focus in < 25% of the time of exhaustive hill climb, while holding 2 mm accuracy on textured surfaces / 4 mm on smooth/reflective ones
Works across multiple focus metrics and 9 materials with no recalibration

System

Slide 2 — Why Automated Inspection?

Quality control is critical in aerospace / precision manufacturing: complex geometries, tight tolerances, variable composite materials
Manual inspection is error-prone — humans miss up to 25% of defects in enterprise-scale tasks
Automation is faster (< 40% of human inspection time), more accurate (+17%), and cheaper (~40% cost reduction per component)
High-mix, low-volume sectors (aerospace) have been slow to adopt — motivates this work

Slide 3 — The Autofocus Problem

Focus on visual inspection: low-cost, accessible; goal is consistently sharp images
Core challenge: maintain accurate focus while preserving constant magnification
Cannot use in-lens autofocus — it changes focal length → changes magnification → corrupts geometric data for downstream ML/defect detection
Solution: focus by moving the robot arm, not the lens
Shallow depth of field on a macro lens makes reliable autofocus essential

Experimental setup with focus curves along principal axis

Slide 4 — Why Not Standard Optimization?

Focus Value (FV) curve is non-smooth: extensive flat regions + a narrow peak
Newton’s method: noise-sensitive, needs a good initial guess, fails in flat/non-smooth regions; no analytic gradient/Hessian for FV
Bayesian optimization: assumes smoothness, needs per-object/environment tuning, struggles with flat regions and narrow peaks
Neither delivers real-time, robust autofocus across diverse inspection scenarios

Slide 5 — Our Approach & Contributions

A data-driven, ratio-based adaptive autofocus framework
(i) Novel ratio-based robotic autofocus that works across multiple focus metrics & materials without recalibration
(ii) Comprehensive evaluation of focus metrics under consistent conditions across materials
(iii) High-precision demonstration: sub-mm on textured, 4 mm on smooth surfaces
Repurposes ratio control (from mixing applications) — uses the ratio of focus values as the control input, so it’s metric-scale-agnostic

Autofocus control loop with ratio-based velocity feedback

Slide 6 — Focus Value & The Three Metrics

FV = image sharpness from high-frequency edge content; plotted vs. focal distance → bell curve, peak = optimal focus
Variance of Sobel — robust to Gaussian noise, but lower sensitivity to fine edges
Squared Gradient (SG) — more edge-sensitive, but can amplify noise
FSWM (frequency-selective weighted median) — strong noise robustness + edge preservation, but computationally expensive
Different strengths → motivates a system that adapts across metrics
$F V = \sum_{x, y} (G_{x}^{2} + G_{y}^{2})$ for SG; FSWM uses weighted-median filtering then variance

Slide 7 — The Core Problem: Metrics Vary Wildly in Scale

FV magnitude depends on surface texture (rough steel ≫ smooth carbon fiber) and on the chosen metric
Direct velocity control on raw FV → constant recalibration; impractical
Figure: FV across FSWM / Sobel / SG on steel spans orders of magnitude (note: log scale)

Log-scale FV comparison across metrics

Slide 8 — The Ratio Insight

Instead of raw FV, use the sign of derivatives + a ratio formulation:
- $\nabla F V [t] = F V [t] - F V [t - 1]$
- $\nabla^{2} F V [t] = \nabla F V [t] - \nabla F V [t - 1]$
- $r [t] = F V [t] / F V [t - 1]$
Derivative signs are consistent across metrics → reliable coarse/fine mode switching
The ratio compresses the dynamic range to a narrow band (r < 10) → one set of parameters works across metrics & materials

Ratio compresses dynamic range across metrics

FV, first/second derivative, and ratio vs. position (steel, Sobel)

Slide 9 — The Control Algorithm

Nonlinear velocity control (not discrete position steps) → smooth, responsive robot motion
Three phases:
- Coarse mode ( $\nabla F V > 0, \nabla^{2} F V > 0$ ): $v = k_{v} / r [t]$ → low ratio far away = high velocity, fast sweep
- Fine mode ( $\nabla F V > 0, \nabla^{2} F V < 0$ ): $v = k_{v} r [t]$ → velocity decreases near peak for precision
- Stop / final correction ( $\nabla F V = 0$ ): settle on stored max-FV position, correct overshoot
$k_{v}$ = single unitless gain balancing speed vs. safety
Starts unidirectional from the far end of the FV curve

Autofocus control-logic flowchart

Slide 10 — Handling Noise: DEMA Smoothing

Surface variation, robot motion, illumination, and sensor noise corrupt raw FV → noisy ratio/derivatives
Apply Double Exponential Moving Average (DEMA): $F V = 2 \overline{F V} - \overline{\overline{F V}}$
Reduces lag vs. plain EMA, preserves the location of the optimal FV
Velocity profiles stay consistent across metrics without retuning

Velocity profiles across metrics on steel

Raw vs. DEMA-filtered FV (steel, FSWM)

Slide 11 — Experimental Setup

UR5e manipulator + eye-in-hand imaging: Sony IMX477 sensor, macro lens, addressable LED ring
Magnification 0.18×, ~30 cm focus distance, 30 fps @ 1080p
ROS2 + MoveIt2 Servo converts real-time velocity commands → joint trajectories
9 materials spanning texture/reflectivity: carbon fiber, unpolished steel, PCB, breadboard, foam, PLA, wood, LED strip, solder board
Benchmark: Exhaustive Hill Climb (EHC) at constant velocity; both use identical fps, start position, distance; ratio $k_{v} = 0.4$ , EHC at half

Test coupons: 9 materials

Slide 12 — Results

Time: > 75% improvement over EHC across all materials and all metrics (~5–6 s vs. ~28–30 s)
Accuracy:
- Within 2 mm: breadboard, carbon fiber, foam, PLA (near robot’s absolute position limit)
- Within 4 mm: LED strip, PCB, steel, wood, solder board
- Max error: FSWM 1.99 mm, Sobel 3.69 mm, SG 2.97 mm; averages 0.85 / 1.56 / 1.32 mm
Ratio bound: $r [t] < 10$ across everything → compresses FV range by > 90%

FV, ratio, and velocity for carbon fiber / steel / PCB (Sobel)

Slide 13 — Impact of Material Characteristics

Convergence speed is largely insensitive to surface appearance (uniform >75% time gain)
Accuracy depends more on surface finish than on metric choice:
- Reflective / low-texture (steel, LED strip, solder, wood) → weaker gradients → larger error
- Strong intensity variation (carbon fiber) → stronger gradients → better accuracy
- Textured + low-reflectivity (breadboard, foam, PLA) → clearest focus ratios
DEMA keeps performance robust despite noise; ratio bounds instability across huge FV swings (e.g. SG FV spans 278 → 2.8×10⁵, yet r < 10)

Slide 14 — Conclusion & Future Work

Ratio-based autofocus reliably works across multiple focus metrics and diverse materials with no recalibration
Compressing FV into a bounded ratio is the key to robustness in dynamic inspection environments
Validated on real hardware (UR5e + IMX477 + macro lens) under realistic cycle-time/reachability limits
Future work: noise reduction for smoother ratio & tunable velocity profiles — especially on smooth/specular surfaces
Thank you / Questions

Figure inventory (from the paper)

Slide	File	Paper caption gist
1, 3	`new_system_image.png`	UR5e + IMX477 + macro lens setup, FV curves along principal axis
5	`algo_render.png`	Autofocus control loop; 4-region velocity response
7	`fv_comparison_log.png`	Log-scale FV across FSWM/Sobel/SG on steel
8	`ratio_comparison.png`	Ratio narrows dynamic range across metrics
8	`FV_dFV_ddFV.png`	FV, ∇FV, ∇²FV, ratio vs. position
9	`ratio_paper.png`	Control-logic flowchart
10	`smoothed_vel.png`	Velocity profiles across metrics
10	`fv_triplet.png`	Raw vs. DEMA-filtered FV
11	`coupon.PNG`	9 test materials
12	`results_sobel.png`	FV / ratio / velocity for CF, steel, PCB

Tables (not images): Table I time performance, Table II position accuracy, Table III max ratio vs. FV — recreate as slide tables or pull the headline numbers (Slide 12).

Colin's Vault

Explorer

Presentation-Outline