<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
 <channel>
  <atom:link href="http://www.comsol.com/community/forums/general/rss/thread/28813.rss" rel="self" type="application/rss+xml"/>
  <title>COMSOL Forums: No speedup when using distributed memory cluster</title>
  <link>http://www.comsol.com/community/forums/general/thread/28813/</link>
  <description>Most recent forum messages</description>
  <pubDate>Thu, 04 Oct 2012 20:50:57 +0000</pubDate>
  <image>
   <title>COMSOL Forums: No speedup when using distributed memory cluster</title>
   <url>http://www.comsol.com/shared/images/logos/comsol_logo.gif</url>
   <link>http://www.comsol.com/community/forums/general/thread/28813/</link>
  </image>
  <item>
   <title>Re: No speedup when using distributed memory cluster</title>
   <link>http://www.comsol.com/community/forums/general/thread/28813/#p87439</link>
   <description>Dear Josh,&lt;br /&gt;&#13;
&lt;br /&gt;&#13;
unfortunately I d not read your post earlier: hope it's not too late!&lt;br /&gt;&#13;
The models that you have tested are provided to test proper operations of the cluster, not performance. For performance, you'd better try to reproduce the results of the following paper, based upon models also available in the model library, for the same number of DOFs: &lt;a href=&quot;http://www.comsol.fr/papers/10248/&quot; title=&quot;www.comsol.fr/papers/10248/&quot;&gt;www.comsol.fr/papers/10248/&lt;/a&gt;&lt;br /&gt;&#13;
&lt;br /&gt;&#13;
Best regards,&lt;br /&gt;&#13;
Stephan&lt;br /&gt;&#13;
&lt;br /&gt;&#13;
--&lt;br /&gt;&#13;
&lt;a href=&quot;http://www.comsol.fr&quot; title=&quot;comsol.fr&quot;&gt;www.comsol.fr&lt;/a&gt;</description>
   <pubDate>Thu, 04 Oct 2012 20:50:57 +0000</pubDate>
   <guid isPermaLink="false">28813.1349383857.87439</guid>
  </item>
  <item>
   <title>No speedup when using distributed memory cluster</title>
   <link>http://www.comsol.com/community/forums/general/thread/28813/#p78429</link>
   <description> I wanted to see if someone could help me determine why I am not seeing any speedup when I submit jobs on multiple nodes of my HPC cluster (using the distributed memory capability).  &lt;br /&gt;&#13;
&lt;br /&gt;&#13;
I understand that speedup is highly model dependent.  Per the suggestions of previous discussion threads, I have tried numerous different models with different physics (both linear and non-linear problems).  Also, I have tried large memory models and small memory models (also per previous thread recommendations).  Below are my results using the COMSOL Model Library Example &amp;quot;Micromixer Cluster Version&amp;quot;:&lt;br /&gt;&#13;
 &lt;br /&gt;&#13;
Micromixer_cluster.mph (mesh as given):&lt;br /&gt;&#13;
1 node; 1 proc.: Run time = 123 sec&lt;br /&gt;&#13;
1 node; 12 proc.: Run time = 38 sec&lt;br /&gt;&#13;
4 nodes; 12 ppn: Run time = 99 sec&lt;br /&gt;&#13;
8 nodes; 12 ppn: Run time = 223 sec&lt;br /&gt;&#13;
 &lt;br /&gt;&#13;
I see speedup when I go from 1 proc. to 12 proc. running on 1 node (shared memory), but I don't see speedup, in fact I see slowdown, when I try to distribute the job across multiple nodes (distributed memory).  The step-by-step instructions said to try refining the mesh for better speedup (this was also COMSOL support's recommendation).  Here are the results for 2 different refined meshes:&lt;br /&gt;&#13;
 &lt;br /&gt;&#13;
Micromixer_cluster.mph (refined mesh):&lt;br /&gt;&#13;
1 node; 1 proc.: Run time = 566 sec&lt;br /&gt;&#13;
1 node; 12 proc.: Run time = 130 sec&lt;br /&gt;&#13;
4 nodes; 12 ppn: Run time = 259 sec&lt;br /&gt;&#13;
8 nodes; 12 ppn: Run time = 501 sec&lt;br /&gt;&#13;
 &lt;br /&gt;&#13;
Micromixer_cluster.mph (super-refined mesh):&lt;br /&gt;&#13;
1 node; 1 proc.: Run time = 1169 sec&lt;br /&gt;&#13;
1 node; 12 proc.: Run time = 414 sec&lt;br /&gt;&#13;
4 nodes; 12 ppn: Run time = 614 sec&lt;br /&gt;&#13;
8 nodes; 12 ppn: Run time = 896 sec&lt;br /&gt;&#13;
&lt;br /&gt;&#13;
Still no speedup.  Only slowdown.&lt;br /&gt;&#13;
&lt;br /&gt;&#13;
Has anyone seen any speedup on this COMSOL Model Library example?  If so, I'd be interested in your results.  &lt;br /&gt;&#13;
&lt;br /&gt;&#13;
One thing that I am doing that is different than the COMSOL recommendation is submitting jobs through the command line rather than through the Desktop.  Does anyone know why COMSOL recommends submitting batch cluster jobs through the Desktop and not through the command line?  Could this be my issue?&lt;br /&gt;&#13;
&lt;br /&gt;&#13;
Any help would be appreciated.&lt;br /&gt;&#13;
</description>
   <pubDate>Wed, 23 May 2012 17:54:43 +0000</pubDate>
   <guid isPermaLink="false">28813.1337795683.78429</guid>
  </item>
 </channel>
</rss>
