Category Archives: Semantic Publishing

Posts about semantic publishing.

Communication during the Coronavirus (I): Thematic analysis in Spanish digital news media

While it is obvious that the priority during this pandemic is to cure the sick, to prevent new cases from surfacing and to ensure there are economic and social measures in place to help the people and businesses most afflicted overcome the current situation; without a doubt, in the near future, the analysis of content related to the coronavirus that has been generated by the media and social network users will be the object of research for numerous disciplines such as sociology, philology, linguistics, audio-visual communication, and politics, to name a few.

At MeaningCloud we want to do our bit in this area, by applying our experience and our Text Analytics solutions to analyze the enormous volume of information in natural language, in Spanish and in other languages, in Spain and in other countries, given that, unfortunately, this is a global crisis.

This first article in the series centers on the thematic analysis of content that has been generated in Spanish by digital media platforms in Spain over the last month, how it has evolved during this period of time and the informative positioning of the main media platforms in Spain.

These other articles (only available, at the moment, in Spanish) analyse conversation topics on Twitter in Spain (both from the hashtags and general topics perspective and also applying a specific thematic categorization) and the linguistic analysis of presidential speeches related to this crisis.

Continue reading


Case Study: Text Analytics against Fake News

Everybody has heard about fake news. Fake news is a neologism that can be formally defined as a type of yellow journalism or propaganda that consists of deliberate disinformation or hoaxes spread via traditional print and broadcast news media or online social media. It is also commonly used to refer to fabricated or junk news, with no basis in fact, but presented as being factually accurate.

The reason for putting someone’s efforts in creating fake news is mainly to cause financial, political or reputational damage to people, companies or organizations, using sensationalist, dishonest, or outright fabricated headlines to increase readership and dissemination among readers using viralization. In addition, clickbait stories, a special type of fake news, earn direct advertising revenue from this activity.

Continue reading


Books Are a Service

Semantic Publishing and Voice of the Customer understanding for the media&content industry

The reason for publishing being a key industry to take advantage of text analytics is also the reason why the industry finds it so hard to engage with the technology.

Books are a serviceThe reason? Text. And a lot of it. The publishing world has struggled to understand how data relates to text and understand the value of data. This is changing, too slow for many, as the industry moves from seeing themselves as a ‘product’ based company (e.g. making books, e-books or physical) to a ‘service’ based company. In other words smart publishers are starting to see their service to customers as the creator and curator of information. This content is abled to be mixed and mashed-up in dynamic ways across a number of formats. This service is not bound, saddle-stitch or otherwise, to a specific product. This 180-degree perspective change requires publishers to think more directly about customer experience in the same way more traditional service based industries like hospitality or even retail banking.

Continue reading


Text Analytics for Publishing: there’s metadata and smarter metadata

Everyone agrees metadata is great. It helps simplify the management and packaging of content and data. It creates consistency and provenance of your content and data across an organization. Metadata gives you that 35000 feet perspective that is needed to make strategic decisions. This is especially important for publishers whose stock in trade is human language, which is completely opaque to machines whose world consists of zeros and ones. Your customers aren’t calling or emailing you to know what is in such and such database. No. They are contacting you because they want to know what monographs you have by such and such professor or asking you for all the archival material on ‘cats’, ‘World War 2’ or ‘nanotubes’. As a human, you understand exactly what they are looking for. If your ICT has a smidgeon of metadata, you can dig around that such-and-such database and deliver the content and have a happy customer.

Intelligent content for Semantic Publishing

Metadata TagMetadata makes your content more intelligent. That’s why everyone agrees metadata is great. Great until they have to either enter the metadata or maintain the vocabularies. Some organizations are lucky. They have ensured there is support within the workflow and people with the expertise to do the hard work so when that customer searches on the website, they quickly find what they are looking for and go away happy. But, even those lucky few do not live in isolation. There is no publisher of consequence who doesn’t have do deal with 3rd party content and data. A huge amount of additional effort is spent shoehorning 3rd party content into the metadata models of the organization. Every publisher has a workflow that includes completely throwing away existing metadata and spending additional time and wasteful effort to add metadata that their CMS can handle. Does that sound familiar? Does it feel better to know you aren’t the only one?

Continue reading


#ILovePolitics: Political discourse analysis in social media

We continue with the #ILovePolitics series of tutorials! We will show how to use MeaningCloud for extracting interesting insights to build your own Political Intel Reports and, at the same price, turning you into a Data Scientist giant in the field of Social Media Analytics.

political issues

Political issues

Politics and Social Media Analytics

Our research objective is to study and compare the discourse of different politicians during the electoral campaign, using their messages in Twitter. We are going to compare tweets by the four most popular (mentioned) politicians in our previous tutorial: Barack Obama (@barackobama), Hillary Clinton (@HillaryClinton), Donald Trump (@realDonaldTrump) and Jeb Bush (@JebBush).

  • What are their key messages?
  • What do they focus on?
  • Are really there different ways of doing politics?

Before we start, three remarks: 1) we will focus on U.S. Politics, in English language, but the same analysis can be adapted for your own country or language as long as it is supported in MeaningCloud, 2) this is a technical tutorial: we will develop some coding, but in general, everyone can understand the purpose of this tutorial, and 3) although this tutorial will use PHP, any non-rookie programmer can translate the programs to any language.

Continue reading


#ILovePolitics: Popularity analysis in the news

If you love politics, regardless of your party or political orientation, you may know that election periods are exciting moments and having good information is a must to increase the fun. This is why you follow the news, watch or listen to political analysis programs on TV or radio, read surveys or compare different points of view from one or the other side.

American politics in a nutshell

American politics

Starting with this, we are publishing a series of tutorials where we will show how to use MeaningCloud for extracting interesting political insights to build your own political intel reports. MeaningCloud provides useful capabilities for extracting meaning from multilingual content in a simple and efficient way. Combining API calls with open source libraries in your favorite programming language is so easy and powerful at the same time that will awaken for sure the Political Data Scientist hidden inside of you. Be warned!

Our research objective is to analyze mentions to people, places, or entities in general in the Politics section of different news media. We will try to carry out an analysis that can answer the following questions:

  • Which are the most popular names?
  • Does their popularity depend on the political orientation of the newspaper?
  • Is it correlated somehow to the popularity surveys or voting intentions polls?
  • Do these trends change over time?

Before we begin

This is a technical tutorial in which we will develop some coding. However, we will try to guide you through the whole process, so everyone can follow the explanations and understand the purpose of the tutorial.

For the sake of generality and better understanding, we will focus on U.S. Politics in English, but obviously you can easily adapt the same analysis for your own country or (MeaningCloud supported) language.

And last but not least, this tutorial will use PHP as programming language for the code examples. However, any non-rookie programmer should be able to translate the scripts into any language of their choice.

Continue reading


Semantic Publishing: a Case Study for the Media Industry

Semantic Publishing at Unidad Editorial: a Client Case Study in the Media Industry 

Last year, the Spanish media group Unidad Editorial deployed a new CMS developed in-house for its integrated newsroom. Unidad Editorial is a subsidiary of the Italian RCS MediaGroup, and publishes some of the newspapers and magazines with highest circulation in Spain, besides owning nation-wide radio stations and a license of DTTV incorporating four TV channels.

Newsroom El Mundo

Newsroom El Mundo

When a journalist adds a piece of news to the system, its content has to be tagged, which constitutes one of the first steps in a workflow that will end with the delivery of this item in different formats, through different channels (print, web, tablet and mobile apps) and for different mastheads. After evaluation of different provider’s solutions in the previous months, the company then decided that semantic tagging would be done through Daedalus’ text analytics technology. Semantic publishing included, in this case, the identification (with disambiguation) of named entities (people, places, organizations, etc.), time and money expressions, concepts, classification according to the IPTC scheme (an international standard for the media industry, with around 1400 classes organized in three levels), sentiment analysis, etc.

Continue reading


Our new Semantic Publishing API is now available in MeaningCloud

This API allows you to produce and publish more valuable contents, more quickly and at lower costs

UPDATE: this API has been discontinued. Use instead our Solution for Semantic Publishing, featuring APIs like Topics Extraction, Text Classification and Automatic Summarization.

At MeaningCloud we keep developing our roadmap and offering new vertical APIs, optimized for different industries and applications. We are pleased to announce that our Semantic Publishing solutions include a new API, designed especially for media, publishers and content providers in general.

It is a logical step for us, since at MeaningCloud we have been collaborating for years with the most significant enterprises in these industries (PRISA, Unidad Editorial, Vocento, RTVE, lainformacion.com, etc.) and this is one of the markets where we are detecting more demand and where our solutions are gaining more traction.

The Semantic Publishing API incorporates the know-how we have been developing when working with these large companies and packages it in the form of semantic resources, process pipelines and specific configurations for the most common applications and scenarios of this industry: archive management, content generation, customization of information products, etc.

Continue reading


MeaningCloud – Meaning as a Service: our new SaaS offering

A few weeks ago we were talking on here about how Daedalus (now called MeaningCloud) has explored various API-based business models for marketing its semantic technologies. Our perception was that basic language features are hard to use for many developers —not experts in these technologies— and require them to build solutions through a process of trial and error, in a do-it-yourself fashion, which is slow and inefficient.

Our vision was to offer the market APIs with a more plug-and-play philosophy, which provide a functionality closer to the business, a faster learning curve and, as a result, an increased productivity. That vision has crystallized in our product Textalytics and today we can say that it is now available.

Textalytics is the easiest way to embed semantics into your applications

Textalytics (now called “MeaningCloud”) is a SaaS offering that provides a high-level multilingual semantic processing functionality to those developers / integrators who want to develop semantic solutions in an effective, quick and cheap way. Compared with other semantic offerings in service mode, MeaningCloud offers several APIs, each with a specific functionality which is close to an application domain, and SDKs, plug-ins, etc. that make its learning and use much easier, reducing the effort required to obtain results and time-to-market.

Textalytics - Meaning as a Service

Continue reading