Heres one way to extract frequently-mentioned noun chunks from a document: If you run that on the London Wikipedia article, youll get output like this: westminster abbey natural history museum west end east end st paul's cathedral royal albert hall london underground great fire british. (Source: Wikipedia article London this paragraph contains several useful facts. But parsing word dependencies is particularly complex task and would require an entire article to explain in any detail.

But with NLP, its a breeze. Step 6: Dependency Parsing The next step is to figure out how all the words in our sentence relate to each other. Theres another great interactive demo from spaCy here. Or if you arent a Python user and end up using a different NLP library, the ideas should all work roughly the same way. If we parse this with our NLP pipeline, well know that it was founded by Romans. But our NLP model doesnt know what pronouns mean because it only examines one sentence at a time. Coreference resolution is one of the most difficult steps in our pipeline to implement.

