Skip to content

3. Content syndication

Objectives
  • Analyse a specific example of the use of markup language for the transmission of information.
  • Define an XML document based on its syntax and structure.
Assessment criteria
  • The advantages that content syndication brings to the management and transmission of information have been identified.
  • Its areas of application have been defined.
  • Technologies based on content syndication have been analysed.
  • The structure and syntax of a content channel has been identified.
  • Content channels have been created and validated.
  • Functionality and access to channels are checked.
  • Specific tools are used such as aggregators and channel directories.

Introduction

The syndication (or broadcasting) of web content is that a part of the content of a web page is made available to other sites or individual subscribers through a web channel, the most common used format is RSS, followed by the Atom. Computer programs that comply with any of these standards they periodically consult a file with several entries linking to the full or partial articles on the original website. Unlike other media, web content broadcast rights are usually free, and do not usually intervene a contract between the parties but a licence rules.

Basic content syndication scheme

A little history

As a web user, it is very important to get links from websites that interest you and store them. In such a way that you are able to get back to them quickly. If you wanted to follow the changes on the web pages the only way to do that it was to visit the website to check for news.

The advent of what became known as Web 2.0 complicated things. The Web was filled with a large number of blogs and pages that published information, and visiting them all to see if there were any changes took a long time, and, in addition, it is very likely that pages have not changed.

The advent of standard syndication systems made possible to obtain information updates a website in a stable way through an specific address. Content syndication changed the way the content is retrieved. You no longer had to look for information: it was the information that went to the user.

By using syndication, user no longer needs to visit the pages that interest him to see if there are any changes, because if there are any you will already receive them. This leads to savings of time, as you will not have to visit pages to discover that there are no changes.

Another advantage that syndication brings is inherent in XML. Unlike of what happens with HTML, it is easy to interpret the content of the information that is received and therefore it will also be easy to be able to reuse the content to make others tasks.

Although syndication is often seen as a system focused on detecting updates on the Web, it is also being used to keep updates in other fields. For example, some computer programs use RSS to find out if there are new versions or updates and thus keep the programs up to date.

Content syndication today

At present, with the emergence of social networks and other technologies, the use of content syndication with standard formats has decreased and the content broadcasting has focused on posting on social media and the use of other sharing systems such as APIs.

However, in the world of podcast, it continues to be used as an automatic way to share content.

podcast

A podcast is a radio, audio, or video file intended for podcasting.

Podcasting refers to the technique for creating and broadcasting audio and video files over the Internet that allows the user to download them to their computer and listen to them or see them whenever you want.

Example of use

Event 24h24l.org

Using a podcatcher we can download the new podcasts published.

Adding the feed (name given to the list of items) shows us what's new that there is.

gPodder usage example

Once added, we are subscribed and the podcatcher will notify us of news.

gPodder usage example 2

Structure of content channels

Web feeds allow programs to check if there are updates posted on a website. To provide a web channel, the site owner may use specialized software (such as a content management system) that publishes a list (or "feed") of articles or recent content in a standardized automatic reading format.

The channel can be downloaded through programs that use it, such as websites that distribute feed content, or through feed readers called aggregators that allow Internet users to subscribe to channels and view their content.

A channel (feed) contains entries, which can be headlines, full-text articles, extracts, summaries, or links to content from a website along with various metadata.

Next we will analyse the characteristics of the two standards, based on XML, for creating content channels: RSS 2.0 and Atom 1.0.

RSS 2.0

RSS (RDF Site Summary or Really Simple Syndication) is a web feed (or feed) that allows users and applications access website updates in a format standardized and computer readable.

It was developed by the RSS Advisory Board and its first version, RSS 0.90, appeared in March 1999. The latest version, 2.0, was released in March 2009.

Info

A standard XML file format ensures compatibility with many different machines / programs. RSS feeds also benefit users who want to receive timely updates from favorite websites or add data from many sites.

RSS is an XML dialect. The RSS format itself is relatively easy to read for both automated and human processes. An example feed can have content as follows:

<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0">
<channel>
 <title>RSS Title</title>
 <description> This is an example of an RSS feed</description>
 <link>http://www.example.com/main.html</link>
 <copyright>2020 Example.com All rights reserved</copyright>
 <lastBuildDate>Mon, 06 Sep 2010 00:01:00 +0000</lastBuildDate>
 <pubDate>Sun, 06 Sep 2009 16:20:00 +0000</pubDate>
 <ttl>1800</ttl>

 <item>
  <title>Example entry</title>
  <description>Here is some text containing an interesting 
    description.</description>
  <link> http://www.example.com/blog/post/1</link>
  <guid isPermaLink = "false">7bd204c6-1655-4c27-aeee-53f933c5395f</guid>
  <pubDate>Sun, 06 Sep 2009 16:20:00 +0000</pubDate>
 </item>
</channel>
</rss>

RSS feed tree diagram RSS feed tree diagram

At the top level, an RSS document is a <rss> element, with an attribute required called version, which specifies the RSS version it meets the document.

Subordinate to the <rss> element is a <channel> element that contains information about the channel (metadata) and its content in <item> elements.

RSS 2.0 Specification

For more details you can access the specification: RSS 2.0 Specification

Despite the fact that there is not an official RSS schema, we can find some implementations like this: RSS 2.0.

Atom

The atom syndication format is an XML dialect used for web feeds.

The Atom format was developed as an alternative to RSS. Ben Trott, defender of the new format that became Atom, believed that RSS had limitations and flaws, such as the lack of continuous innovation and its need to remain compatible, and that there were advantages in a new design.

A document based on the Atom format has the following structure:

<?xml version="1.0" encoding = "utf-8"?>

<feed xmlns = "http://www.w3.org/2005/Atom">
  <title> Example Feed </title>
  <subtitle> A subtitle. </subtitle>
  <link href = "http://example.org/feed/" rel = "self" />
  <link href = "http://example.org/" />
  <id> urn: uuid: 60a76c80-d399-11d9-b91C-0003939e0af6 </id>
  <updated> 2003-12-13T18: 30: 02Z </updated>


<entry>
  <title> Atom-Powered Robots Run Amok </title>
  <link href = "http://example.org/2003/12/13/atom03" />
  <link rel = "alternate" type = "text / html"
    href = "http://example.org/2003/12/13/atom03.html" />
  <link rel = "edit" href = "http://example.org/2003/12/13/atom03/edit" />
  <id> urn: uuid: 1225c695-cfb8-4ebb-aaaa-80da344efa6a </id>
  <updated> 2003-12-13T18: 30: 02Z </updated>
  <summary> Some text. </summary>
    <content type = "xhtml">
      <div xmlns = "http://www.w3.org/1999/xhtml">
        <p> This is the entry content. </p>
      </div>
    </content>
    <author>
      <name> John Doe </name>
      <email> johndoe@example.com </email>
    </author>
</entry>

</feed>

For more details: RFC 4287 - The Atom Syndication Format

Validation

Since both RSS and Atom are XML documents, you will be able to check that they are correct using the same verification tools that are used in XML.

Although it is possible to use XML validators, the most common is to do so serve specific programs to validate RSS and Atom such as the W3C Feed Validation Service (validator.w3.org/feed)

Tasks

  1. Using a RSS aggregator

    In the next task we will install an RSS aggregator and subscribe to some channels.

    1. Install the Firefox add-on: Feedbro.
    2. Access the add-on.
    3. Subscribe to the following channels:
      1. The podcasts of the 24h24l.org event.
      2. The latest news from El País
      3. The front page news of El Mundo.
      4. Check if any of the pages you consult regularly redistribute their contents. If so, subscribe to their channel.
  2. 302rssfeed.xml. Creating an RSS feed

    You were the creator of the web recipe-example.org and you currently have in the main page the following content.

    <!DOCTYPE html>
    <html lang = "ca">
    <head>
      <meta charset = "UTF-8">
      <meta name = "viewport" content = "width = device-width, initial-scale = 1.0">
      <title>Recipes</title>
    </head>
    
    <body>
    <header>
      <h1>Sample recipes </h1>
    </header>
    <main>
    <article>
      <h2>Baked rice</h2>
      <time datetime = "2020-12-13 08:00:00"> Sunday, December 13
        2020 </time>
      <p> The peculiarity of this rice, as the name suggests, is
        which is baked. I,
        like paella and other Valencian rice dishes, it is also a
        dish of popular origin, which in this case was made from
        of the remains of the putxero. That's why among his
        Ingredients are not missing chickpeas, pork chops and
        sausage. This dish is especially typical in counties such as
        la Costera, where it has been celebrated in Xàtiva for a long time
        a few years the National Baked Rice Contest. </p>
      <p> <a href="/2020/12/arros-al-forn.html"> Continue reading </a> </p>
    </article>
    <article>
      <h2>Paella valenciana</h2>
      <time datetime="2020-04-06 13:20:00"> Monday, April 6,
       2020 </time>
      <p> The paella is the highest standard of Valencian cuisine and
      probably also of the Spanish, due to its recognition
      gastronomy worldwide. Its origin, like that of all
      dishes of popular cuisine, derived from the conjunction of foods that
      each family had at their disposal, especially in the area of
      the orchard of Valencia, which was supplied with fresh vegetables.
      In addition, it was formerly customary to raise chickens and rabbits for
      to family use, therefore, if we add abundance to all this
      of the rice cultivated in the Albufera, the result is this
      genuinely Valencian dish that receives the name of the container in the
      which is cooked. </p>
      <p><a href="/2020/04/paella-valenciana.html"> Continue reading </a> </p>
    </article>
    <article>
      <h2>Olleta</h2>
      <time datetime = "2019-12-28 18:20:00"> Friday, December 28
      2019 </time>
      <p> L'olleta is the most representative dish in the mountain area
      of Alicante and in the interior of Valencia. This is a succulent putxero,
      similar to a broth stew but one of the most refined in all of Spain,
      and for this reason it is recognized nationally. Its intense flavor
      and fragrant derives from the aromatic fragrance of the sausages and,
      once degreased, it reaches a sublime point. </p>
      <p> <a href="/2019/12/olleta.html"> Continue reading </a> </p>
    </article>
    </main>
      <footer>
      </footer>
      </body>
    </html>
    
    Create the RSS feed in version 2.0 so that users know when you post new recipes.

    Validate the file.

  3. 303Ivoox. Given the following IVOOX screenshot create by hand in a sheet an RSS feed that announces the three podcast episodes.

Credits, bibliography and webography