Monday, June 1, 2009

Wednesday, April 8, 2009

Platform LSF Desktop Support Version 7 Update 2 - Platform LSF Desktop Support Administrator\xd5 s Guide - Introducing Platform LSF Desktop Support

Platform LSF Desktop Support Version 7 Update 2 - Platform LSF Desktop Support Administrator\xd5 s Guide - Introducing Platform LSF Desktop Support

lsf installation guide

http://ls11-www.informatik.uni-dortmund.de/people/hermes/manuals/LSF/install.pdf

lsf licensing

The FLEXlm license key and license server are independent of LSF clusters and LSF server hosts. You can organize your hosts into clusters any way you choose. The license server counts only the total number of hosts running LSF; these hosts can belong to any cluster in your network. As long as all your hosts can contact each other on the network, you should request a single license key that covers all the hosts on which you plan to run LSF.


FLEXlm normally runs the license server daemon on one host. LSF tolerates failure of the license server daemon for up to 60 hours, as long as the LSF Load Information Manager (LIM) on the master host is not restarted or shutdown. In spite of this, if you are concerned about reliability, you can run the license server daemon on three hosts. Software licenses are available as long as two of the three license servers are available


You should run the license server on the host that is the NFS server for the LSF software. That way the licenses are available whenever the software is. However, it is not recommended that you make the license server host the same as the master host for the cluster. If you do this, and the master host goes down, the backup master that takes over will not be able to check license tokens out from the license server on the original master, which has failed.

Sunday, March 29, 2009

tidbits

3d cfd model has 5-10m cells.
each cell can solve 7 equations per iteration
cfd solvers are iterative solvers, needs 3-4k iterations
one CPU can handle up to 0.5m cells

parallel computing - which is better: multicore, smp (large shared memory for structual/thermal apps), cluster (massively parallel mpp; high speed connected hosts for cfd apps) or grid (slow speed connected hosts)


fluent 4gb/core
16 cores: 2 serial + 14 parallel
4 cores: 1 serial + 3 parallel

cfx 4gb/core
8 cores: 1 serial + 7 parallel

ls-dyna 4gb/core
4 cores:

abacus 4gb/core
4 cores

ansys 4gb/core
8 cores

mscnastran/nxnastran 4gb/core
4 cores

suggested config

cpu: 4 nodes x 2 cpu per node x 4 cores per cpu
ram: 4 gb per core = 16 gb per cpu = 32 gb per node

costing:
nodes: USD 33500 for 4 nodes, (is it inclusive of rack, cabling etc)
infiniband: USB 14500 for 4 nodes
quadrics: na
storage: lustre, san,
mgmt: scheduler, ssi, failover job restart, ganglia, nagios, big brother
scheduler: lsf USD 15000
support: next business day included, 24*7 4-hour USD 400

considerations: application's system requirements, sys admin hassle (least possible units, user mgmt : active directory, ldap), end user ease of use, cost effective/long shelf life/sustained ROI, training/support from vendors,

cost to consider: ramp up time, licensing cost for different hw architectures

scalability: most os allow 32 to 64 partitions in one partition, for openmp limit is 8 to 16 processes.
benchmarking: pallas

what about license server? flexlm
do we have some benchmark results?

virtual lsf master with failover to smp host
common lsf config dir [on nas]
wall clock queues (short, medium, unlimited)
app starts [wrappers to lsf]
multiple versions of apps
ad integration
licensing using lsf resources [existing flexlm license server]
elim dynamic resource update based on actual license server usage
move job data to local stratch at dispatch
user machines are standard win32/64


todo
hardware recommendations
parallel file system design
test server/dev env hardware bom
implementation support plan
handoff
move to preppost licenses [to stnadardize/separate out solver licenses][provides a means to keep working while solve is running]
add mechhps licenses [permits more than 2 procs per job]
add hpc solvers support to all ansys users [3-4 nodes]

is os on different nodes (exec, master, license etc) decided?
does bom include rack cost?
does bom include nw equipment inside rack cost?
http://www.bioteam.net/resources/presentations/SGE-Workshop-2007/SGE_and_LSF.pdf

Platform LSF Version 6.1 - Using the Platform LSF SDK - Welcome

Platform LSF Version 6.1 - Using the Platform LSF SDK - Welcome

choosing: SGE vs LSF vs Torque

choosing: SGE vs LSF vs Torque

Re: [SAGE] [lopsa-discuss] LSF vs GRID engine

Re: [SAGE] [lopsa-discuss] LSF vs GRID engine

Sunday, March 22, 2009

problems

sleep on stomach
saliva discharge in sleep
eczema, skin rashes easily (check for blood platlet count, read http://en.wikipedia.org/wiki/Hodgkin's_lymphoma)

beer

21st March 2009 7:00 PM: had 500ml carlsberg, no problem that i remember ,
21st march 2009 10:00 PM: had rice, mushrooms, and dal 1 hour before drinking, had two 330ml tiger, got slightly stuffy nose within 1 hour of drinking, but it was fine by 2:00 AM
23rd march 2009 12:30 AM: had maggi 1 hour before drinking, had carslbery 330 ml on 22nd March 2009 1:30 AM , completely stuffed nose within 3 hours of drinking, severe water discharge from left eye and left nostril, enlarged sinus, , had difficult sleep, itchy eyes whole day, no elevated problem of skin rashes

jaw pain:

tiger (not always)
http://www.steadyhealth.com/Pain_cramp_in_jaw_bone___especially_when_I_drink_alcohol_spirits_t102397_st20.html (in rare cases), according to post above, jaw pain after consuming alcohal is linked to hodgin's disease
http://en.wikipedia.org/wiki/Hodgkin

stuufed nose: carlsberg

no problem: corona,