Creating graphs from a corpus of documents
In this section, we will use the information we extracted in the previous section using the different text engines to build networks that relate the different information. In particular, we will focus on two kinds of graphs:
- Knowledge-based graphs, where we will use the semantic meaning of sentences to infer relationships between the different entities.
- Bipartite graphs, where we will be connecting the documents to the entities that appear in the text. We will then project the bipartite graph into a homogeneous graph, which will be made up of either document or entity nodes only.
Knowledge graphs
Knowledge graphs are very interesting as they not only relate entities but also provide a direction and a meaning to the relationship. For instance, let's take a look at the following relationship:
I (->) buy (->) a book
This is substantially...