Evaluating and testing AI-infused software is a new challenge. Traditional software testing assumes a predictable system. AI systems are unpredictable, uncertain and unreliable, which creates risk for AI products. This session provides insights about challenges, best practices and focus areas to give AI teams confidence in the reliability of their AI and ML applications.