Paradigms Of Evaluation In Natural Language Processing: Field Linguistics For Glass Box Testing