1 <!DOCTYPE HTML PUBLIC
"-//IETF//DTD HTML//EN">
4 <link rel=
"stylesheet" href=
"../mrtg-td/slides.css" type=
"text/css">
5 <meta http-equiv=
"Content-Type" content=
"text/html; charset=iso-8859-1">
6 <meta name=
"Language" content=
"en">
7 <meta name=
"Author" content=
"Petter Reinholdtsen">
11 <h1>Writing enterprise ready software
</h1>
15 <p><tt><a href=
"http://www.hungry.com/~pere/mypapers/enterprise-software/enterprise-software.html">http://www.hungry.com/~pere/mypapers/enterprise-software/enterprise-software.html
</a></tt></p>
17 <div class=
"presenter">Petter Reinholdtsen
19 <br>Debconf5, Helsinki
2005-
06-
12</div>
25 <li>clues for the clueless
26 <li>multilevel configuration
30 <h2>We are in trouble
</h2>
32 <li>Some thinks are possible for
1 to
10 machines, and impossible with
33 500 machines. What do you do when you break the ssh configuration
36 <h2>Trouble moving
</h2>
38 <p>With
60000 users and about
150 home directory file servers
39 available from
12000 machines, users move from file server to file
40 server. This break several applications when the path to the users
41 home directory changes. (example: /mn/hegel/u1/pere to
42 /usit/saruman/u1/pere).
</p>
45 Moving a user from one user disk to another break mozilla and
46 openoffice configuration. - not using relative paths
49 <h2>Downgrade trouble
</h2>
51 <p>With
900 linux machines with common user database and home
52 directories while running different versions of programs, users will
53 run several versions of a program with the same configuration
56 <!-- Trying to run with KDE 2 and 3 on different machines messes up
57 the configuration - not handling downgrades
59 - handle up/downgrades - people will use different versions on
60 different machines, with common home directories
64 <h2>Some old trouble
</h2>
67 <p>Some users lost the source of their production systems, and need
68 the binaries to keep working for
10-
15 years.
</p>
70 <!-- 15 year old binaries should keep working. The source is lost
71 long time ago, and the program is still used in production. -->
73 <h2>No room for more trouble
</h2>
75 <p>When the file system for the PostgreSQL database in production goes
76 full, one do not want to kick out
30000 users to take down the
77 database and resize the file system.
</p>
81 <p>RAID is only only useful until the last redundancy disk is lost.
83 <p>Automatic RAID status systems need API or command line tools to
84 extract the status. Not like afacli, which go into interactive mode
85 when an error is detected.
87 <h2>Installation trouble
</h2>
89 <p>Trying to compile/install software on Irix, Solaris, Linux, HP-UX,
90 Tru64 Unix, MacOSX and AIX when the process require a sysadmin to sit
91 around to answer questions, change CDs, or insert licenses is both
92 painful and prone to errors.
</p>
94 <h2>Network trouble
</h2>
96 <p>Trying to get some network server to work when it require to use
97 some given port range, which is already taken by some other service
98 and blocked in the router -- or try to get the coorporate network
99 gatekeeper to open up the firewall
</p>
101 <h2>Version trouble
</h2>
103 <p>Given three tcl or php applications, is there one version of TCL or
104 PHP usable with all of these?
106 <h2>Usability trouble
</h2>
108 <p>When starting a program from the meny, where does it go if nothing
109 appear on the screen?
111 <p>Do users always read their ~/.xsession-error file?
114 <h2>Clues for the clueless
</h2>
118 <li>at least three levels of config files; package defaults, site
119 defaults and host defaults
121 <li>never ask questions at compile time. when compiling automatically
122 for
10 platforms, a sysadmin do not want to sit down and answer
125 <li>split installation tasks in two, one for installation, and one for
126 the operations needing root access. (build / configuration as well)
128 <li>make sure the software can be installed anywhere (location
129 independent), avoid hardcoding paths into the binaries.
131 <li>make the source available to make it possible to fix problems on
132 site, and to use it on different platforms (os/hw) in the future
134 <li>Make paths into users home directories relative to ~user/, as users
135 will move from disk to disk, or copy their home directory from site
136 to site. Always convert paths when saving config files.
138 <li>depend on as few libraries as possible, as it is a pain to get every
139 extra library in place
141 <li>use well known libraries instead of making your own implementation.
142 reduses the security risk.
144 <li>make sure libraries, and programming languages are backwards
147 <li>use a well known license. it is a pain to evaluate every new
150 <li>when distributing source, do not use vendor specific compiler
151 features. It will not work with the other vendors compilers used to
154 <li>write portable code, make sure it works the same on all platforms.
156 <li>make the software work out of the box (require as little
157 configuration as possible).
159 <li>avoid resource leaks (memory, shared memory, locks, file
160 descriptors, X server resources, etc). Restarting a long-running
161 server is not always an option.
163 <li>system services should send messages to syslog. always log why when
164 crashing. always log problems and errors.
166 <li>reuse configuration. ktouch have its own x layout setting.
167 better to fetch the current one from X like xkeycaps.
169 <li>providing hooks to the local admins
171 <li>reduce flexibility. trying to support people over the phone when
172 the gui is different for every person is a pain.
176 <h2>Solving the upgrade problem using multilevel
181 <li>local configuration should be kept during upgrades
</li>
183 <li>easiest to do if the local configration is separate from the
186 <li>several actors what to have a say in the service
187 configuration. allow them to have their own files
189 <li>Example: read config from /usr/share/foo/config,
190 /site/share/foo/config, /etc/foo/config, ~/.foo/config,
191 /etc/foo/config.fixed, /site/share/foo/config.fixed,
192 /usr/share/foo/config.fixed.
194 <li>make it possible to provide package, site, host and user
195 defaults, as well as locking down features on a host, site and
198 <li>always well known where the admin made his changes
204 <h2>Thank you very much
</h2>