Framework for testing the performance of parsing and evaluation.