The geometric and semantic scene information forms the basis of many computer vision applications, such as autonomous driving, service/home robotics, video surveillance, that require to automatically make the critical decisions based on this information, Hence,
an accurate estimation of both geometric and semantic information becomes essential because inaccurate visual information may lead to a catastrophic failure .