Subscribe: Frederick Giasson's Weblog
http://feeds.feedburner.com/FredOnSomething
Added By: Feedage Forager Feedage Grade B rated
Language: English
Tags:
cognonto  corpuses  datasets  graph  kbpedia knowledge  kbpedia  knowledge graph  knowledge  learning  machine learning  text  training 
Rate this Feed
Rate this feedRate this feedRate this feedRate this feedRate this feed
Rate this feed 1 starRate this feed 2 starRate this feed 3 starRate this feed 4 starRate this feed 5 star

Comments (0)

Feed Details and Statistics Feed Statistics
Preview: Frederick Giasson's Weblog

Frederick Giasson



Data Scientist & Software Developer



Published: 2016-11-21T11:14:06Z

 



Leveraging KBpedia Aspects To Generate Training Sets Automatically

2016-11-21T11:14:06Z

In previous articles I have covered multiple ways to create training corpuses for unsupervised learning and positive and negative training sets for supervised learning 1 , 2 , 3 using Cognonto and KBpedia. Different structures inherent to a knowledge graph like KBpedia can lead to quite different corpuses and sets. Each of these corpuses or […]



Dynamic Machine Learning Using the KBpedia Knowledge Graph – Part 2

2016-11-17T11:05:32Z

In the first part of this series we found the good hyperparameters for a single linear SVM classifier. In part 2, we will try another technique to improve the performance of the system: ensemble learning. So far, we already reached 95% of accuracy with some tweaking the hyperparameters and the training corpuses but the F1 […]



Dynamic Machine Learning Using the KBpedia Knowledge Graph – Part 1

2016-11-17T11:00:43Z

In my previous blog post, Create a Domain Text Classifier Using Cognonto, I explained how one can use the KBpedia Knowledge Graph to automatically create positive and negative training corpuses for different machine learning tasks. I explained how SVM classifiers could be trained and used to check if an input text belongs to the defined […]



Building and Maintaining the KBpedia Knowledge Graph

2016-11-07T19:57:15Z

The Cognonto demo is powered by an extensive knowledge graph called the KBpedia Knowledge Graph, as organized according to the KBpedia Knowledge Ontology (KKO). KBpedia is used for all kinds of tasks, some of which are demonstrated by the Cognonto use cases. KBpedia powers dataset linkage and mapping tools, machine learning training workflows, entity and […]



Literate [Clojure] Programming: Tangle All in Org-mode

2016-10-26T18:19:29Z

This blog post is the fifth of a series of blog posts about Literate [Clojure] Programming in Org-mode where I explain how I develop my [Clojure] applications using literate programming concepts and principles. This new blog post introduce a tool that is often necessary when developing literate applications using Org-mode: the tangle all script. As […]



Create a Domain Text Classifier Using Cognonto

2016-10-25T00:49:36Z

A common task required by systems that automatically analyze text is to classify an input text into one or multiple classes. A model needs to be created to scope the class (what belongs to it and what does not) and then a classification algorithm uses this model to classify an input text. Multiple classification algorithms […]



Mapping Datasets, Schema and Ontologies Using the Cognonto Mapper

2016-10-07T12:20:28Z

There are many situations were we want to link named entities from two different datasets or to find duplicate entities to remove in a single dataset. The same is true for vocabulary terms or ontology classes that we want to integrate and map together. Sometimes we want to use such a linkage system to help […]



Improving Machine Learning Tasks By Integrating Private Datasets

2016-10-04T15:00:29Z

In the last decade, we have seen the emergence of two big families of datasets: the public and the private ones. Invaluable public datasets like Wikipedia, Wikidata, Open Corporates and others have been created and leveraged by organizations world-wide. However, as great as they are, most organization still rely on private datasets of their own […]



Using Cognonto to Generate Domain Specific word2vec Models

2016-09-28T19:27:28Z

word2vec is a two layer artificial neural network used to process text to learn relationships between words within a text corpus to create a model of all the relationships between the words of that corpus. The text corpus that a word2vec process uses to learn the relationships between words is called the training corpus. In […]



Web Page Analysis With Cognonto

2016-09-23T17:48:31Z

Extract Structured Content, Tag Concepts & Entities   Cognonto is brand new. At its core, it uses a structure of nearly 40 000 concepts. It has about 138,000 links to external classes and concepts that defines huge public datasets such as Wikipedia, DBpedia and USPTO. Cognonto is not a children’s toy. It is huge and […]