Modern Distributed Filesystems For Linux: An Introduction
Administrative Issues in Distributed Filesystems

Bill von Hagen
Wednesday, August 7, 2002 11:42:51 AM
Using a distributed filesystem introduces new commands and new
concerns for system administrators, but also simplifies many standard
administrative tasks. Distributed computing environments typically
enable users to log in on any workstation within an administrative
domain. This requires that the login, or authentication, mechanism,
also be network-aware. In distributed filesystem environments,
password and group files located on individual machines must be
secondary to networked authentication mechanisms. A network-aware
authentication mechanism, such as Kerberos or NIS, provides users the
flexibiltiy to use any workstation, while standard machine-specific
authentication mechanisms must still exist so that administartors can
log in on individual machines to repair them or perform administrative
tasks.
Storing shared data on centralized file servers rather than on
individual desktop systems simplifies administrative tasks such as
backing up and restoring files and directories. It also centralizes
standard storage administration tasks such as monitoring filesystem
use, and introduces new possibilities for storage management, such as
load balancing. Distributed filesystems such as OpenAFS and
Coda provide built-in logical volume management systems that enable
administrators to move heavily-used volumes to more powerful or
lightly-used machines. If the distributed filesystem supports
replication, copies of heavily-used volumes can be distributed across
multiple fileservers for use by different clients. This can reduce
network use and lighten the load on specific servers. By using logical
volumes rather than disk-specific physical volumes, distributed
filesystems can also make it easy to add storage to your computing
environment while your systems are running, without requiring
downtime.
Using a distributed filesystem also makes it easier to share access to
software, though you have to make sure that your software licenses
enable you to install software into a distributed filesystem. Like the
print servers that were part of the original motivation for
client/server computing, distributed filesystems also simplify sharing
access to specialized hardware by connecting to the system that hosts
the hardware over the network and still being able to see all your
files and data.
Using a centralized distributed filesystem can provide significant
cost and performance benefits for client systems. Distributed
filesystems substantially reduce hardware costs by minimizing the type
and amount of storage that is required on any desktop or
laptop workstation. Using a distributed filesystem as the repository
for user data usually means faster client restart times because much
of the data is no longer stored locally and therefore does not have to
be checked for filesystem consistency after restarting a client. The
combination of a distributed filesystem using a journaling filesystem
for all or most of the filesystems local to client systems can provide
additional improvements in system restart times.
Next: Support for Disconnected Operation »