<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.0.4" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/">
<channel>
	<title>Comments on: Massively Scalable&#8230;</title>
	<link>http://blog.jabber.com/filaments/2006/11/14/massively-scalable/</link>
	<description>Jabber Filaments Blog</description>
	<pubDate>Mon, 08 Sep 2008 06:16:29 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.0.4</generator>

	<item>
		<title>by: wajid</title>
		<link>http://blog.jabber.com/filaments/2006/11/14/massively-scalable/#comment-16960</link>
		<pubDate>Sun, 23 Sep 2007 17:14:22 +0000</pubDate>
		<guid>http://blog.jabber.com/filaments/2006/11/14/massively-scalable/#comment-16960</guid>
					<description>oovoo.com</description>
		<content:encoded><![CDATA[<p>oovoo.com
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: MarkM</title>
		<link>http://blog.jabber.com/filaments/2006/11/14/massively-scalable/#comment-8576</link>
		<pubDate>Fri, 18 May 2007 06:32:44 +0000</pubDate>
		<guid>http://blog.jabber.com/filaments/2006/11/14/massively-scalable/#comment-8576</guid>
					<description>Do you have any metrics on XCP (5.2 or earlier) running on a Linux Server vs. the Solaris 10 configuration that you presented here?

Craig Kaes replies:
&lt;em&gt;XCP is a highly configurable platform so despite a number of large load tests we've done on Linux, none would serve as a direct comparison to the 1M user Solaris test.  The closest one to that configuration that we did was a 200k user test spread over five XCP routers, each running on a dual proc xeon 3.6Ghz machine.  Additionally we used 16 connection manager processes spread of eight similar machines.  The two largest differences between this test and the Solaris test were 1) PostgreSQL 8.1 (Linux) vs. Oracle 10 (Solaris) and 2) the Linux test was run before a number of configuration and code optimizations were identified and implemented.  In the Linux test, the database was definitely the limiting factor where in the Solaris test, we were only using about 1/10 of the resources on the database server.

It would certainly be interesting to run a large scale Linux test similar to what we've already done on Solaris -- especially with the optimizations in and with the knowledge we've gained.
&lt;/em&gt;</description>
		<content:encoded><![CDATA[<p>Do you have any metrics on XCP (5.2 or earlier) running on a Linux Server vs. the Solaris 10 configuration that you presented here?</p>
<p>Craig Kaes replies:<br />
<em>XCP is a highly configurable platform so despite a number of large load tests we&#8217;ve done on Linux, none would serve as a direct comparison to the 1M user Solaris test.  The closest one to that configuration that we did was a 200k user test spread over five XCP routers, each running on a dual proc xeon 3.6Ghz machine.  Additionally we used 16 connection manager processes spread of eight similar machines.  The two largest differences between this test and the Solaris test were 1) PostgreSQL 8.1 (Linux) vs. Oracle 10 (Solaris) and 2) the Linux test was run before a number of configuration and code optimizations were identified and implemented.  In the Linux test, the database was definitely the limiting factor where in the Solaris test, we were only using about 1/10 of the resources on the database server.</p>
<p>It would certainly be interesting to run a large scale Linux test similar to what we&#8217;ve already done on Solaris &#8212; especially with the optimizations in and with the knowledge we&#8217;ve gained.<br />
</em>
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Baffe</title>
		<link>http://blog.jabber.com/filaments/2006/11/14/massively-scalable/#comment-21</link>
		<pubDate>Fri, 17 Nov 2006 16:10:58 +0000</pubDate>
		<guid>http://blog.jabber.com/filaments/2006/11/14/massively-scalable/#comment-21</guid>
					<description>&lt;blockquote&gt;What amazed even us was that as the scale of the test increased, the efficiency of Jabber XCP also increased while CPU utilization decreased significantly.&lt;/blockquote&gt;
Do you mean exactly what you write or are there some "per user" missing?

--------

&lt;cite&gt;&lt;a rel="external nofollow" href="http://www.jabber.com/" rel="nofollow"&gt;Craig Kaes&lt;/a&gt;&lt;/cite&gt; Says:

&lt;a rel="nofollow" href="http://blog.jabber.com/2006/11/14/massively-scalable/#comment-23" rel="nofollow"&gt;November 17, 2006 at 4:06 pm&lt;/a&gt;
Yeah -- cpu decreased per box -- not overall. This decrease was  linear which, together with the linear addition of the usersa shows overall  linear scalability and also shows that we were too conservative in the number of  users that we added as we added additional hardware</description>
		<content:encoded><![CDATA[<blockquote><p>What amazed even us was that as the scale of the test increased, the efficiency of Jabber XCP also increased while CPU utilization decreased significantly.</p></blockquote>
<p>Do you mean exactly what you write or are there some &#8220;per user&#8221; missing?</p>
<p>&#8212;&#8212;&#8211;</p>
<p><cite><a rel="external nofollow" href="http://www.jabber.com/" rel="nofollow">Craig Kaes</a></cite> Says:</p>
<p><a rel="nofollow" href="http://blog.jabber.com/2006/11/14/massively-scalable/#comment-23" rel="nofollow">November 17, 2006 at 4:06 pm</a><br />
Yeah &#8212; cpu decreased per box &#8212; not overall. This decrease was  linear which, together with the linear addition of the usersa shows overall  linear scalability and also shows that we were too conservative in the number of  users that we added as we added additional hardware
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Geekwad</title>
		<link>http://blog.jabber.com/filaments/2006/11/14/massively-scalable/#comment-15</link>
		<pubDate>Thu, 16 Nov 2006 20:00:26 +0000</pubDate>
		<guid>http://blog.jabber.com/filaments/2006/11/14/massively-scalable/#comment-15</guid>
					<description>"What amazed even us was that as the scale of the test increased, the efficiency of Jabber XCP also increased while CPU utilization decreased significantly."  Umm...  what?  Am I reading that right?  You've discovered an inexhaustible source of CPU cycles?

-------- &lt;cite&gt;&lt;a rel="external nofollow" href="http://www.jabber.com/" / rel="nofollow"&gt;&lt;/cite&gt;

&lt;cite&gt;&lt;a rel="external nofollow" href="http://www.jabber.com/" rel="nofollow"&gt;Craig Kaes&lt;/a&gt;&lt;/cite&gt; Says: 						&lt;a href="http://blog.jabber.com/2006/11/14/massively-scalable/#comment-23" / rel="nofollow"&gt;

&lt;a href="http://blog.jabber.com/2006/11/14/massively-scalable/#comment-23" rel="nofollow"&gt;November 17, 2006 at 4:06 pm&lt;/a&gt;
To clarify - As we increased the number of processors, CPU utilization per box decreased, not overall CPU utilization. What this showed was that we could have added more users with each box that we added and that the growth was indeed linear. It would be absurd to assert exponential scalability &lt;img class="wp-smiley" alt=";-)" src="http://blog.jabber.com/wp-includes/images/smilies/icon_wink.gif" /&gt;</description>
		<content:encoded><![CDATA[<p>&#8220;What amazed even us was that as the scale of the test increased, the efficiency of Jabber XCP also increased while CPU utilization decreased significantly.&#8221;  Umm&#8230;  what?  Am I reading that right?  You&#8217;ve discovered an inexhaustible source of CPU cycles?</p>
<p>&#8212;&#8212;&#8211; <cite><a rel="external nofollow" href="http://www.jabber.com/" / rel="nofollow"></cite></p>
<p><cite><a rel="external nofollow" href="http://www.jabber.com/" rel="nofollow">Craig Kaes</a></cite> Says: 						<a href="http://blog.jabber.com/2006/11/14/massively-scalable/#comment-23" / rel="nofollow"></p>
<p><a href="http://blog.jabber.com/2006/11/14/massively-scalable/#comment-23" rel="nofollow">November 17, 2006 at 4:06 pm</a><br />
To clarify - As we increased the number of processors, CPU utilization per box decreased, not overall CPU utilization. What this showed was that we could have added more users with each box that we added and that the growth was indeed linear. It would be absurd to assert exponential scalability <img class="wp-smiley" alt=";-)" src="http://blog.jabber.com/wp-includes/images/smilies/icon_wink.gif" />
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: alex9512</title>
		<link>http://blog.jabber.com/filaments/2006/11/14/massively-scalable/#comment-13</link>
		<pubDate>Thu, 16 Nov 2006 16:56:26 +0000</pubDate>
		<guid>http://blog.jabber.com/filaments/2006/11/14/massively-scalable/#comment-13</guid>
					<description>What was the specific hardware configuration, and how much, and what kind of tuning did you do with the operating system? (I assume solaris 10)

----------

&lt;cite&gt;&lt;a rel="external nofollow" href="http://www.jabber.com/" rel="nofollow"&gt;Craig Kaes&lt;/a&gt;&lt;/cite&gt; Says:
November 17, 2006 at 11:31 am

We used 2 4-core T2000s for our connection managers and 6 8-core T2000s for our routers. Because of the nature of the baseline and the constraints of the test, we had 3 of the 8 cores disabled on each of the
6 router boxes. It would be a delight to see what we could do if we used all 8 cores for those tests.

As for system tuning, we got access to a beta tool from Sun called CoolTuner (&lt;a rel="nofollow" target="_blank" href="http://www.sun.com/download/products.xml?id=4501bae1" rel="nofollow"&gt;http://www.sun.com/download/products.xml?id=4501bae1&lt;/a&gt;). It added the following to the system file, but I’m not sure that any of it was actually needed. We were using Solaris 10 and the OS really never got in our way.

set ip:ip_squeue_bind=0
set ip:ip_squeue_fanout=1
set ipge:ipge_tx_syncq=1
set ipge:ipge_srv_fifo_depth=16000
set ipge:ipge_bcopy_thresh=512
set ipge:ipge_dvma_thresh=1
set pcie:pcie_aer_ce_mask=0×1 # Not necessary in T2000+ shipping June
2006
set rlim_fd_cur=260000
set rlim_fd_max=260000
set maxphys=1048576
set md:md_maxphys=1048576</description>
		<content:encoded><![CDATA[<p>What was the specific hardware configuration, and how much, and what kind of tuning did you do with the operating system? (I assume solaris 10)</p>
<p>&#8212;&#8212;&#8212;-</p>
<p><cite><a rel="external nofollow" href="http://www.jabber.com/" rel="nofollow">Craig Kaes</a></cite> Says:<br />
November 17, 2006 at 11:31 am</p>
<p>We used 2 4-core T2000s for our connection managers and 6 8-core T2000s for our routers. Because of the nature of the baseline and the constraints of the test, we had 3 of the 8 cores disabled on each of the<br />
6 router boxes. It would be a delight to see what we could do if we used all 8 cores for those tests.</p>
<p>As for system tuning, we got access to a beta tool from Sun called CoolTuner (<a rel="nofollow" target="_blank" href="http://www.sun.com/download/products.xml?id=4501bae1" rel="nofollow">http://www.sun.com/download/products.xml?id=4501bae1</a>). It added the following to the system file, but I’m not sure that any of it was actually needed. We were using Solaris 10 and the OS really never got in our way.</p>
<p>set ip:ip_squeue_bind=0<br />
set ip:ip_squeue_fanout=1<br />
set ipge:ipge_tx_syncq=1<br />
set ipge:ipge_srv_fifo_depth=16000<br />
set ipge:ipge_bcopy_thresh=512<br />
set ipge:ipge_dvma_thresh=1<br />
set pcie:pcie_aer_ce_mask=0×1 # Not necessary in T2000+ shipping June<br />
2006<br />
set rlim_fd_cur=260000<br />
set rlim_fd_max=260000<br />
set maxphys=1048576<br />
set md:md_maxphys=1048576
</p>
]]></content:encoded>
				</item>
</channel>
</rss>
