Towards Spatial Perception: Learning to Locate Objects From Vision

  abstract =     "Our humanoid robot learns to provide position
                 estimates of objects placed on a table, even while the
                 robot is moving its torso, head and eyes (cm range
                 accuracy). These estimates are provided by trained
                 artificial neural networks (ANN) and a Cartesian
                 genetic programming (GP) method, based solely on the
                 inputs from the two cameras and the joint encoder
                 positions. No prior camera calibration and kinematic
                 model is used. We find that ANN and GP are both able to
                 localise localise objects robustly even while the robot
                 is moving, with an accuracy comparable to current
                 techniques used on the iCub.",
