< Back to list
Extracted Abstract:
—This study proposes to construct the Great Wall knowledge graph (GWKG) in a semi-automatic way. First, under the guidance of domain experts, we build the professional dictio- nary and ontology layer, where BERT is applied for Named Entity Recognition. The resulting entities are clustered by Word2Vec to automatically refine the ontology layer. Then, we conduct Relation Extraction based on semi-supervision, link entities to encyclopedia websites, and obtain semi-structured information by crawler technology for attribute filling. Finally, we visualize the GWKG and report the results of multiple data formats. The GWKG consists of 34 ontology concepts, 33 types of relations, more than 6,000 entities and attributes, and 720,000 Chinese corpora. Index Terms—knowledge graph, the Great Wall, ontology, semi-automatic I.