Lars Eggert – XMPP Deployment Trends

Global XMPP Deployment

10.2%

500 sites tested
0 DNS errors
51 with XMPP

China

2.0%

500 sites tested
0 DNS errors
10 with XMPP

Germany

8.2%

500 sites tested
1 DNS error
41 with XMPP

Finland

7.6%

500 sites tested
0 DNS errors
38 with XMPP

India

8.8%

499 sites tested
1 DNS error
44 with XMPP

IETF

12.7%

2764 sites tested
198 DNS errors
325 with XMPP

South Korea

5.0%

500 sites tested
0 DNS errors
25 with XMPP

United Kingdom

8.8%

500 sites tested
0 DNS errors
44 with XMPP

United States

11.6%

500 sites tested
1 DNS error
58 with XMPP

What do these numbers mean?

This experiment attempts to answer the following question: If an average user had a working installation of XMPP on their machine, how useful would it be to them? What percentage of the services and sites the average user regularly accesses are XMPP-enabled? In other words, the experiment attempts to quantify the usefulness of XMPP to the average end user, given the current deployment of XMPP in the Internet.

The experiment does not track how many users or hosts use XMPP in the current Internet. It also does not track how many sites have configurations of XMPP that are not accessible by average users from the Internet.

The Extensible Messaging and Presence Protocol (XMPP) is an open XML technology for real-time communication, which powers a wide range of applications including instant messaging, presence, media negotiation, whiteboarding, collaboration, lightweight middleware, content syndication, and generalized XML routing.

The IETF statistics are based on a list of domain names that are derived from the email addresses of currently-active document authors of Internet Engineering Task Force (IETF) documents. This data set was included to investigate if the organizations that IETF authors come from are more progressive in deploying XMPP, compared to the rest of the Internet.

How are these numbers generated?

The scripts that update this page retrieve the names of the web sites that are most popular across the globe, as well as in select countries, from alexa.com in regular intervals. They then check whether the DNS entry for each site name reflects that it uses XMPP. The numbers above show the percentage of these top sites that are XMPP-enabled, as well as the absolute numbers.

Note that although the DNS entry for a site may indicate that XMPP is available, this does not necessarily mean that actually using XMPP with the site will succeed. I’ll eventually add code to verify that XMPP can be used with sites that claim to enable it.

How representative are these numbers?

They’re reasonably representative, but not perfect. One issue is that the sample sets are very small; alexa.com typically offers lists of 100 to 500 top sites for free, depending on the country. More importantly, though, the sample sets are derived from web site names, because that’s all alexa.com offers. It is not clear that checking XMPP deployment based on a set of web site names is resulting in numbers that represent deployment of XMPP in the broader Internet.

Attention, operators: I’m interested in basing these statistics on a more meaningful data set. If you can provide me with a regularly-updated list of most-frequently-looked-up DNS names – or, for SPF or DKIM, a list of the domains that generate the most inbound email – please contact me at please enable javascript to view . The larger your network and the longer the list, the better.

How have these numbers been changing over time?

Funny you should ask. The graphs below (click on each image to get a PDF that lets you zoom in) illustrate the weekly changes of XMPP deployment in the various sample sets since these measurements started in October 2007:

XMPP deployment trends - click to zoom in (PDF)

This graph shows the same data as the one above, but zooms in on the interesting area:

XMPP deployment trends (magnified) - click to zoom in (PDF)

Significant jumps in the historic data (e.g., fall 2008 or spring 2009) are usually due to alexa.com changing what data they make available, or on tracking bugs having been fixed. The latter fixes are often based on suggestions of visitors to this page. See the acknowledgements below.

Download deployment trends as text: global cn de fi in jp kr uk us ietf

Acknowledgements and Changes

  • The original idea for these statistics came out of discussions on an “IPv6 clock” in Joe Touch’s group of PhD students at USC/ISI around 1999 – we just never got around to implementing it.
  • Thanks to Jari Arkko for the affiliation information of IETF authors, obtained from his author statistics .
  • Miguel Garcia explained how to track SIP deployment.
  • Marcus Isomäki suggested to track XMPP deployment.
  • Jim Fenton pointed out a critical bug in my DKIM tracking code.
  • Rickard Bondesson pointed out a critical bug in my DKIM tracking code in September 2008.
  • Frank Ellermann suggested to track SPF deployment.
  • Eric Vyncke suggested to check some subdomains commonly used for IPv6 in October 2008.
  • RK suggested to follow SPF redirects in March 2009.
  • In April 2009, alexa.com made 500 domain names per country available, which affected the results.
  • Iljitsch van Beijnum pointed out a bug that affected the sort order of the detailed results in March 2010.
  • Added more IPv6 subdomains in June 2010.