{"id":141683,"date":"2014-01-31T08:36:30","date_gmt":"2014-01-31T08:36:30","guid":{"rendered":"https:\/\/staging2.simonw59.sg-host.com\/context-exploiting-linked-data-for-content-analysis\/"},"modified":"2022-05-26T13:42:08","modified_gmt":"2022-05-26T13:42:08","slug":"context-exploiting-linked-data-for-content-analysis","status":"publish","type":"post","link":"https:\/\/force11.org\/post\/context-exploiting-linked-data-for-content-analysis\/","title":{"rendered":"conTEXT: Exploiting Linked Data for Content Analysis"},"content":{"rendered":"<div class=\"ct_body\">\n<p><a href=\"http:\/\/context.aksw.org\">conTEXT <\/a>is a platform for lightweight text analytics. It allows to semantically analyze text corpora (such as blogs, RSS\/Atom feeds, Facebook, G+, Twitter or <a href=\"http:\/\/slidewiki.org\">SlideWiki.org<\/a> decks) and provides novel ways for <span class=\"text-success\">browsing<\/span> and <span class=\"text-warning\">visualizing<\/span> the results. Furthermore, conTEXT allows researchers to provide a semantic overview of their written text in terms of the concepts and their relations in the text.<\/p>\n<p style=\"text-align: center\"><img decoding=\"async\" alt=\"conTEXT workflow\" src=\"http:\/\/blog.aksw.org\/wp-content\/uploads\/2014\/01\/workflow.jpg\" title=\"workflow\" \/><\/p>\n<p>The process of text analytics in conTEXT starts by collecting information from the web. conTEXT utilizes standard information access methods and protocols such as RSS\/ATOM feeds, SPARQL endpoints and REST APIs as well as customized crawlers for WordPress and Blogger to build a corpus of information relevant for a certain user. The assembled text corpus is then processed by Natural Language Processing (NLP) services (currently <a href=\"http:\/\/aksw.org\/Projects\/FOX.html\">FOX <\/a>and <a href=\"http:\/\/spotlight.dbpedia.org\/\">DBpedia-Spotlight<\/a>) which link unstructured information sources to the Linked Open Data cloud through DBpedia. The processed corpus is then further enriched by de-referencing the&nbsp; DBpedia URIs as well as&nbsp; matching with pre-defined natural-language patterns for DBpedia predicates (<a href=\"http:\/\/aksw.org\/Projects\/BOA.html\">BOA patterns<\/a>). The processed data can also be joined with other existing corpora in a <em>text analytics mashup<\/em>. The creation of analytics mashups requires dealing with the heterogeneity of different corpora as well as the heterogeneity of different NLP services utilized for annotation. conTEXT employs <a href=\"http:\/\/nlp2rdf.org\/\">NIF <\/a>(NLP Interchange Format) to deal with this heterogeneity. The processed, enriched and possibly mixed results are presented to users using different views for exploration and visualization of the data. Additionally, conTEXT provides an annotation refinement user interface based on the RDFa Content Editor (<a href=\"http:\/\/rdface.aksw.org\/\">RDFaCE<\/a>) to enable users to revise the annotated results. User-refined annotations are sent back to the NLP services as feedback for the purpose of learning in the system.<\/p>\n<p>For more information on conTEXT visit:<a href=\"http:\/\/context.aksw.org\/\"><img decoding=\"async\" alt=\"\" class=\"size-full wp-image-1376 alignright\" height=\"63\" src=\"http:\/\/blog.aksw.org\/wp-content\/uploads\/2014\/01\/logo.png\" style=\"float: right\" title=\"logo\" width=\"207\" \/><\/a><\/p>\n<ul>\n<li>\n\t\tOnline demo:&nbsp; <a href=\"http:\/\/context.aksw.org\">http:\/\/context.aksw.org<\/a><\/li>\n<li>\n\t\tScreencast: <a href=\"http:\/\/youtu.be\/EiGdkDRu_Ew\">http:\/\/youtu.be\/EiGdkDRu_Ew<\/a><\/li>\n<li>\n\t\tSome examples of analyzed corpora:&nbsp; <a href=\"http:\/\/context.aksw.org\/app\/hub.php?corpus=8\">CNN<\/a>, <a href=\"http:\/\/context.aksw.org\/app\/hub.php?corpus=123\">BBC<\/a>, <a href=\"http:\/\/context.aksw.org\/app\/hub.php?corpus=7\">AKSW<\/a>, <a href=\"http:\/\/context.aksw.org\/app\/hub.php?corpus=6\">LOD2<\/a> blogs or tweets from <a href=\"http:\/\/context.aksw.org\/app\/hub.php?corpus=13\">Bill Gates<\/a>, <a href=\"http:\/\/context.aksw.org\/app\/hub.php?corpus=4\">Barack Obama<\/a>, <a href=\"http:\/\/context.aksw.org\/app\/hub.php?corpus=151\">Ali Khalili<\/a> or <a href=\"http:\/\/context.aksw.org\/app\/hub.php?corpus=120\">S&ouml;ren Auer<\/a><\/li>\n<li>\n\t\tPublication: <em>Ali Khalili, S&ouml;ren Auer, Axel-Cyrille Ngonga Ngomo<\/em>: <a href=\"http:\/\/svn.aksw.org\/papers\/2014\/ESWC_conTEXT\/public.pdf\">conTEXT &ndash; Lightweight Text Analytics using Linked Data<\/a><\/li>\n<\/ul>\n<\/div>\n<p class=\"ct_meta\"><span class=\"ct_label\">Archive:<\/span>&nbsp;<a class=\"ct_archive\" target=\"_blank\" href=\"https:\/\/archive.force11.net\/node\/4852\" rel=\"noopener\">https:\/\/archive.force11.net\/node\/4852<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>conTEXT is a platform for lightweight text analytics. It allows to semantically analyze text corpora (such as blogs, RSS\/Atom feeds, Facebook, G+, Twitter or SlideWiki.org decks) and provides novel ways for browsing and visualizing the results. Furthermore, conTEXT allows researchers to provide a semantic overview of their written text in terms of the concepts and &#8230; <a title=\"conTEXT: Exploiting Linked Data for Content Analysis\" class=\"read-more\" href=\"https:\/\/force11.org\/post\/context-exploiting-linked-data-for-content-analysis\/\" aria-label=\"More on conTEXT: Exploiting Linked Data for Content Analysis\">Read more<\/a><\/p>\n","protected":false},"author":206182,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"advgb_blocks_editor_width":"","advgb_blocks_columns_visual_guide":"","footnotes":""},"categories":[182],"tags":[],"force11":[],"blog_series":[],"working_group":[],"class_list":["post-141683","post","type-post","status-publish","format-standard","hentry","category-blog"],"acf":[],"author_meta":{"display_name":"Ali Khalili","author_link":"\/members\/ali1k"},"featured_img":null,"coauthors":[],"tax_additional":{"categories":{"linked":["<a href=\"https:\/\/force11.org\/category\/blog\/\" class=\"advgb-post-tax-term\">Blogs<\/a>"],"unlinked":["<span class=\"advgb-post-tax-term\">Blogs<\/span>"]}},"comment_count":"0","relative_dates":{"created":"Posted 12 years ago","modified":"Updated 4 years ago"},"absolute_dates":{"created":"Posted on 31 Jan 2014","modified":"Updated on 26 May 2022"},"absolute_dates_time":{"created":"Posted on 31 Jan 2014 08:36","modified":"Updated on 26 May 2022 13:42"},"featured_img_caption":"","series_order":"","_links":{"self":[{"href":"https:\/\/force11.org\/wp-json\/wp\/v2\/posts\/141683","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/force11.org\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/force11.org\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/force11.org\/wp-json\/wp\/v2\/users\/206182"}],"replies":[{"embeddable":true,"href":"https:\/\/force11.org\/wp-json\/wp\/v2\/comments?post=141683"}],"version-history":[{"count":0,"href":"https:\/\/force11.org\/wp-json\/wp\/v2\/posts\/141683\/revisions"}],"wp:attachment":[{"href":"https:\/\/force11.org\/wp-json\/wp\/v2\/media?parent=141683"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/force11.org\/wp-json\/wp\/v2\/categories?post=141683"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/force11.org\/wp-json\/wp\/v2\/tags?post=141683"},{"taxonomy":"force11","embeddable":true,"href":"https:\/\/force11.org\/wp-json\/wp\/v2\/force11?post=141683"},{"taxonomy":"blog_series","embeddable":true,"href":"https:\/\/force11.org\/wp-json\/wp\/v2\/blog_series?post=141683"},{"taxonomy":"working_group","embeddable":true,"href":"https:\/\/force11.org\/wp-json\/wp\/v2\/working_group?post=141683"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}