How do you test a program when only a single user, with no expertise in software testing, is able to determine if the
program is performing correctly? Such programs are common today in the form of machine-learned classifiers. We consider the
problem of testing this common kind of machine-generated program...