Subscribe: Frederick Giasson's Weblog
http://feeds.feedburner.com/FredOnSomething
Added By: Feedage Forager Feedage Grade A rated
Language: English
Tags:
cognonto  datasets  graph  kbpedia knowledge  kbpedia  knowledge graph  knowledge  learning  machine learning  machine  text  training 
Rate this Feed
Rate this feedRate this feedRate this feedRate this feedRate this feed
Rate this feed 1 starRate this feed 2 starRate this feed 3 starRate this feed 4 starRate this feed 5 star

Comments (0)

Feed Details and Statistics Feed Statistics
Preview: Frederick Giasson's Weblog

Frederick Giasson



Data Scientist & Software Developer



Published: 2017-01-24T20:26:46Z

 



Disambiguating KBpedia Knowledge Graph Concepts

2017-01-24T19:19:06Z

One of the most important natural language processing tasks is to “tag” concepts in text. Tagging a concept means determining whether words or phrases in a text document matches any of the concepts that exist in some kind of a knowledge structure (such as a knowledge graph, an ontology, a taxonomy, a vocabulary, etc.). (BTW, […]



Extended KBpedia With Wikipedia Categories

2017-01-10T20:10:36Z

A knowledge graph is an ever evolving structure. It needs to be extended to be able to cope with new kinds of knowledge; it needs to be fixed and improved in all kinds of different ways. It also needs to be linked to other sources of data and to other knowledge representations such as schemas, […]



Leveraging KBpedia Aspects To Generate Training Sets Automatically

2016-11-21T11:14:06Z

In previous articles I have covered multiple ways to create training corpuses for unsupervised learning and positive and negative training sets for supervised learning 1 , 2 , 3 using Cognonto and KBpedia. Different structures inherent to a knowledge graph like KBpedia can lead to quite different corpuses and sets. Each of these corpuses or […]



Dynamic Machine Learning Using the KBpedia Knowledge Graph – Part 2

2016-11-17T11:05:32Z

In the first part of this series we found the good hyperparameters for a single linear SVM classifier. In part 2, we will try another technique to improve the performance of the system: ensemble learning. So far, we already reached 95% of accuracy with some tweaking the hyperparameters and the training corpuses but the F1 […]



Dynamic Machine Learning Using the KBpedia Knowledge Graph – Part 1

2016-11-17T11:00:43Z

In my previous blog post, Create a Domain Text Classifier Using Cognonto, I explained how one can use the KBpedia Knowledge Graph to automatically create positive and negative training corpuses for different machine learning tasks. I explained how SVM classifiers could be trained and used to check if an input text belongs to the defined […]



Building and Maintaining the KBpedia Knowledge Graph

2016-11-07T19:57:15Z

The Cognonto demo is powered by an extensive knowledge graph called the KBpedia Knowledge Graph, as organized according to the KBpedia Knowledge Ontology (KKO). KBpedia is used for all kinds of tasks, some of which are demonstrated by the Cognonto use cases. KBpedia powers dataset linkage and mapping tools, machine learning training workflows, entity and […]



Literate [Clojure] Programming: Tangle All in Org-mode

2016-10-26T18:19:29Z

This blog post is the fifth of a series of blog posts about Literate [Clojure] Programming in Org-mode where I explain how I develop my [Clojure] applications using literate programming concepts and principles. This new blog post introduce a tool that is often necessary when developing literate applications using Org-mode: the tangle all script. As […]



Create a Domain Text Classifier Using Cognonto

2016-10-25T00:49:36Z

A common task required by systems that automatically analyze text is to classify an input text into one or multiple classes. A model needs to be created to scope the class (what belongs to it and what does not) and then a classification algorithm uses this model to classify an input text. Multiple classification algorithms […]



Mapping Datasets, Schema and Ontologies Using the Cognonto Mapper

2016-10-07T12:20:28Z

There are many situations were we want to link named entities from two different datasets or to find duplicate entities to remove in a single dataset. The same is true for vocabulary terms or ontology classes that we want to integrate and map together. Sometimes we want to use such a linkage system to help […]



Improving Machine Learning Tasks By Integrating Private Datasets

2016-10-04T15:00:29Z

In the last decade, we have seen the emergence of two big families of datasets: the public and the private ones. Invaluable public datasets like Wikipedia, Wikidata, Open Corporates and others have been created and leveraged by organizations world-wide. However, as great as they are, most organization still rely on private datasets of their own […]