Paul Anderson
Christopher L. Barnard
Ann Benninger
David Blank-Edelman
Strata Rose Chalup
Brent Chapman
Albert Chin-A-Young
Richard Chycoski
Lee Damon
Tina Darmohray
Xev Gittler
Geoff Halprin
Helen Harrison
Christine Hogan
Brian Kantor
Tom Limoncelli
David Lindes
Bryan McDonald
Mark Mellis
Hal Miller
Ruth Milner
Paul Moriarty
Cat Okita
David Parter
Giray Pultar
Ian Reddy
Shawn Rutledge
Phil Scarr
John Sellens
Dimitri Shcherban
Josh Simon
David Williamson
Pat Wilson
9:00 - 10:30 | Introductions and Statements of Problems | |
11:00 - 12:30 | Discussion of Selected Problem(s) | |
13:30 - 15:00 |
Presentations of Recent or Current Work
Group Surveys |
|
15:30 - 17:00 |
Presentations and Surveys (cont'd)
Hot Technologies: Rumors and Refutations |
Atendees were reminded that they were there to share their own experience and knowledge, and not as representatives of their company (insofar as the goal of the workshop was not to sell either companies or products, but rather, to share ideas and problems with our peers). Since many of the issues and problems that were likely to be discussed were difficult simple because of the (sometimes staggering) scale of the problem (e.g., backing up 10 or even 100 gigabytes is hard -- backing up tens of terabytes is nearly impossible!), attendees were asked not to play ``mine's bigger than yours.'' However, when Lee Damon (of Qualcomm) introduced himself as ``working for the people that make these'' (and holding up a Qualcomm Q800 -- a very small cellular telephone), Brent Chapman held up his Morotola StarTAC (another very small cellular telephone) and said ``mine's smaller!'' Lee then noted that it's only when discussing ``cell phones, cameras, and pagers will you hear a man say `mine's smaller than yours' with pride.'' :-)
The attendees, their environment, and their problems are presented in alphabetical order.
Name: | Paul Anderson | |
---|---|---|
Affiliation: | University of Edinburgh | |
Scale: | 1,000- machines | |
Comments: | ||
» Formerly mostly-Sun shop, now diverging to NT and Linux (PDAs, too) | ||
Problems/Issues: | ||
» Providing core infrastructure for user self-support | ||
» NT Integration | ||
» User authentication and security | ||
Name: | Christopher L. Barnard | |
Affiliation: | Chicago Board of Trade | |
Scale: | 530 workstations | |
Problems/Issues: | ||
» Remote administration without remote admins | ||
» 99.98% uptime [1.75 hours downtime per year] | ||
Name: | Ann Benninger | |
Affiliation: | Exelixis Pharmaceuticals | |
Scale: | 100 people, 150 machines, 500 GB | |
Problems/Issues: | ||
» Explosive data growth | ||
» 7x24 expectations without appropriate budgets | ||
Name: | David Blank-Edelman | |
Affiliation: | Northeastern University | |
Problems/Issues: | ||
» Increased security requirements | ||
» Growing demands on sysadmin staff by ever-maturing user population | ||
» Work satisfaction for sysadmins | ||
Name: | Strata Rose Chalup | |
Problems/Issues: | ||
» 200K users messaging system for roaming palm computing | ||
» Scalability of internet services for clients | ||
» Distributed database management (accounts, auth) | ||
» `Fighting the conduit war' [delivering services, not technologies] | ||
» Showing that sysadmins affect the bottom line (and thus are professionals) | ||
Name: | Brent Chapman | |
Affiliation: | Covad | |
Comments: | ||
» Covad is a DSL startup | ||
Problems/Issues: | ||
» Growth -- doubling every 4 months (340 now) [moving frequently] | ||
» VP's who think think understand IT | ||
» 7x24 uptime w/limited staff | ||
Name: | Albert Chin-A-Young | |
Affiliation: | The Written Word/Abbott Labs | |
Scale: | 350 machines, 1,000 users | |
Problems/Issues: | ||
» Packaging of software for distribution | ||
» Managing and exploiting testing environments for multiple platforms | ||
» Single signon [authentication] | ||
» Global filesystem design and implementation [file sharing, UNIX/NT] | ||
» NT/Unix integration; account name management, authentication | ||
Name: | Richard Chycoski | |
Affiliation: | Cisco | |
Problems/Issues: | ||
» User accounting admin on multiple platforms | ||
» Downtime in 7x24 environment | ||
Name: | Lee Damon | |
Affiliation: | Qualcomm | |
Scale: | 10,000 users, 10,000 PCs, 2,000 Macs, 3,000 Unix | |
Problems/Issues: | ||
» Y2K | ||
» Politics | ||
» Merging UNIX & NT environments | ||
Name: | Xev Gittler | |
Affiliation: | Goldman, Sachs | |
Scale: | 10,000+ users | |
Problems/Issues: | ||
» Integration UNIX/NT/Mainframe/Pilot [no macs] | ||
» Applications/developer management (1,000 developers) | ||
» Large scaling issues (including remote offices w/small staff) | ||
Name: | Geoff Halprin | |
Affiliation: | Consulting w/Telstra and IBM | |
Problems/Issues: | ||
» Software currency in large (650+) mission-critical environments | ||
» Standardization of sysadmin practices | ||
Name: | Helen Harrison | |
Affiliation: | SAS Institute | |
Problems/Issues: | ||
» Unix-NT Integration [accounts, filesharing, global sysadmin] | ||
Name: | Christine Hogan | |
Affiliation: | Global Networking and Computing, Inc. (GNAC) | |
Problems/Issues: | ||
» Consulting/Data center outsourcing | ||
» Cost-cutting (sysadmin is cost-center, not profit center) | ||
» Growing demands on sysadmin staff by ever-maturing user population | ||
Name: | Brian Kantor | |
Affiliation: | UCSD | |
Scale: | 25,000 hosts, 40,000 users, high turnover | |
Comments: | ||
» [Most problems are political for Brian, who lives downhill from the outhouse] [Kolstad] | ||
Problems/Issues: | ||
» Finding competent people to work for university salaries | ||
Name: | Tom Limoncelli | |
Affiliation: | Bell Labs (a division of Lucent Technologies) | |
Problems/Issues: | ||
» Single signon | ||
» Cramped machine room space resources | ||
» Lack of theoretical basis for sysadmin | ||
Name: | David Lindes | |
Affiliation: | Netscape Communications | |
Problems/Issues: | ||
» Employee turnover (loss of project memory) | ||
» Maintaining namespaces across worldwide org | ||
» Package management on multiple platforms | ||
Name: | Bryan McDonald | |
Affiliation: | Global Networking and Computing, Inc.(GNAC) | |
Scale: | 10-10,000 users @ clients | |
Problems/Issues: | ||
» Security in face of declining budgets | ||
» Security while outsourcing | ||
» Security with new un-secure applications | ||
» Security in environment of non-technical management | ||
Name: | Hal Miller | |
Affiliation: | University of Washington | |
Scale: |
200 machines, 1,000 next year, 1PB in four years(!)
1TB, doubling every 5 months |
|
Problems/Issues: | ||
» Storage management (including backup) | ||
» Budget allocation/chargeback schemes | ||
» Account management systems (must work w/UNIX, NT, and Macs) | ||
» Building services for space (moving people/networks around) | ||
Name: | Ruth Milner | |
Affiliation: | National Radio Astronomy Observatory | |
Scale: | 500 users, 600 PC/UNIX | |
Problems/Issues: | ||
» Y2K testing of 3rd party s/w (250 apps) | ||
» Single signon/single user account [NT/Unix] | ||
» Consensus on technical decisions | ||
Name: | Paul Moriarty | |
Affiliation: | U S Web/CKS | |
Problems/Issues: | ||
» How does one make sysadmins better project managers? | ||
Name: | Adam Moskowitz | |
Affiliation: | Genome Therapeutics | |
Scale: | 250 users, 300 machines, 1TB store | |
Problems/Issues: | ||
» Explosive data growth | ||
Name: | Cat Okita | |
Affiliation: | Earthworks; Toronto Stock Exchange | |
Scale: | 500 users, 500 machines, varied brokers | |
Problems/Issues: | ||
» Soft skills [writing/presenting, managing, selling ideas/ourselves] | ||
» Marketing/training/certification -- gaining recognition for profession | ||
» Planning | ||
Name: | David Parter | |
Affiliation: | University of Wisconsin, Computer Sciences Department | |
Scale: | 6,000 users | |
Problems/Issues: | ||
» Communication skills [w/staff and w/users] | ||
» Users expectations too high | ||
» `Doing things right' (avoiding ad hoc solutions) | ||
Name: | Giray Pultar | |
Affiliation: | John Hancock Funds | |
Scale: | 2 hosts, 12 users | |
Problems/Issues: | ||
» Data growth | ||
» Euro currency conversion | ||
» Linux in workplace (users bringing in versions, flaky h/w, mgmt buy-in) | ||
Name: | Ian Reddy | |
Affiliation: | Cisco | |
Scale: | 15,000 users (half engr) | |
Problems/Issues: | ||
» Maintaining consistency in sysadmin practices as orgs grow | ||
» Growing into management | ||
» Beyond Y2K | ||
Name: | Shawn Rutledge | |
Affiliation: | Sequent Computer Systems (Consulting Division -- @ Boeing) | |
Problems/Issues: | ||
» Scaling vendor apps to large environments | ||
» Politics of consulting to huge company | ||
» Scaling NT to large environments | ||
» ``Can't do too much to solve problems, because it's NT'' | ||
Name: | Phil Scarr | |
Affiliation: | Global Network and Computing, Inc. (GNAC) | |
Problems/Issues: | ||
» Managing (people/projects) and its transition from technical positions | ||
» Working in NT-only environments at customer sites who have NO tech staff | ||
» Managing politics at numerous client sites | ||
Name: | John Sellens | |
Affiliation: | UUNet Canada (Engineering) | |
Problems/Issues: | ||
» Large storage | ||
» Upgrades without interrupting services | ||
» 7x24 Unmanned remote servers | ||
Name: | Dimitri Shcherban | |
Affiliation: | Contractor at NASA w/users also in Japan | |
Scale: | ~60 users | |
Problems/Issues: | ||
» Storage | ||
» UNIX/NT integration (by elimination of UNIX) | ||
» Support of tropical rainforest mission | ||
Name: | Josh Simon | |
Affiliation: | Consultant, Collective Technology, on-site at Motorola | |
Scale: | team supports thousands of users | |
Problems/Issues: | ||
» Time for projects (vs. process for operations) | ||
» Writing skills for technical and management operations | ||
» Upgrading hw/sw in 7x24 shop | ||
Name: | David Williamson | |
Affiliation: | Global Networking and Computing, Inc. (GNAC) | |
Problems/Issues: | ||
» Selling cost of VPN to management | ||
» Juggling n business clients with n x m passwords | ||
» Scaling | ||
» Dealing with utilities (like telcos) | ||
Name: | Pat Wilson | |
Affiliation: | Dartmouth College | |
Scale: | 3,000 users, 35 unix servers | |
Problems/Issues: | ||
» Retrofitting infrastructure (central backup, authentication, policies) | ||
» Security in academic environment |
An attempt was made to define the problem, but this proved rather more difficult than expected. What follows is Rob Kolstad's herculean attempt to summarize the discussion:
Problem statement(s):
It is possible to construct any of a number of paradigms for system administration at a given site
It is desirable to reduce the number of various paradigms and thus reduce complexity and thus reduce costs
It is desirable to have shared goals among the sysadmin group and global organization
Standard procedures are challenging to create
Standard procedures are difficult to enforce
Motivation for decisions is often omitted from documentation
Turnover and individualism thwart standardization
Intergroup politics can thwart more general standardization
How are goals set for sysadmins?
Are procedures documented so that results are repeatable?
Desirable to create an environment where pre-defined responses exist for all conceivable situations (including routine situations and rare ones)
Management wants to be able to solve problems as they occur, with interchangeable people and standard procedures
Larger organizations demand control mechanisms that differ from the `benign dictatorship' that often works well in smaller shops
Professionalism demand that sysadmins address problems in a standard way that users/etc. can expect
Sysadmin paradigms and tools change very quickly
Sysadmins (or their management) require authority to implement certain paradigm mandates
Big iron data centers might provide a model for solving this problem
Predictability reduces chaos and increases RAS
Geoff Halprin is developing sysadmin taxonomy (!); maybe a best practices document can define a consistent attack
Comments on the problem:
Setting standards is ongoing at several participants' organizations
Current problems similar to those cited by mainframe people when Unix came out
Technicians are not architects and architects are rare
Different people have different sets of strengths
Companies need to exert ownership over their computers -- they are not ``PERSONAL'' computers
Management perceptions of budgetary issues surrounding support organizations is complex, chaotic, and potentially bizarre
Responsibility and authority really must go hand-in-hand
What follows are the highlights of this "discussion"; no attempt was made to record who "presented" what task/tool/idea.
I have 15 years of e-mail and I've extracted the addresses. So, I built a database to use as an incoming spam filter that dumps suspected spam into a special folder.
I wish there was a metric that enabled me to choose ISPs from whom to purchase effective bandwidth.
Of the 35 attendees:
Structure of internet is important to politicians because diversity of suppliers can enhance reliability.
I'd also like to thank Rob Kolstad for his work as scribe and co-chair. His efforts allowed me to concentrate on interacting with the attendees without worrying that what was being said would be lost; he also provided valuable assistance that helped keep the workshop running smoothly. It is also thanks to Rob efforts that this summary exists at all; the only thing I did was sprinkle in some HTML.
Finally, I'd like to thank Judy DesHarnais, the rest of the USENIX conference staff, the USENIX office staff, the USENIX and SAGE boards, and all the rest of the volunteer that helped organize and run the conference. Without them, there would have been no conference.
Adam S. Moskowitz, Workshop chair