Leveraging distributional context for safe and interactive data science at scale