You are assigned ownership of detecting and tracking (6 DoF) 3d objects in a scene observed from lidar, visual and radar sensor modalities mounted on top of an excavator. Describe the system you would build to ensure both accuracy and robustness in broad strokes.✱