Problem
The client needed to compile a database of venture capital (VC) activity to support their product. However, raw details about deals are buried within news articles, which are unstructured, written differently by various authors, and often packed with extraneous information that’s irrelevant to a high-level overview.