From Web to World: Harnessing Foundation Models for Intelligent Robotic Assistants in Real-World Environments