From: Eric Youngdale (eric@tantalus.dell.com)
Date: 08/03/92


From: eric@tantalus.dell.com (Eric Youngdale)
Subject: CDROM filesystem for Linux ready for beta-testing.
Date: 3 Aug 1992 13:56:29 GMT


        The subject line says it all. The file is available from banjo in
pub/Linux/Incoming in cdrom.tar.Z. I am including the readme file which
should explain most everything (related to the CDROM filesystem, that is).

        I want to give credit to David Giller who worked on the SCSI end of
things (I worked on the filesystem).

        I would be very interested in hearing from people who are using this,
even if everything works just fine. This is the only way that I have of
knowing how much of a workout this code is getting.

-Eric

**** README.CDROM

                ANNOUNCING - CDROM support for linux (beta 0.1).

        CDROM support for linux is now ready for beta testing. You
must have a CDROM drive, a SCSI adapter and a ISO9660 format disc
before this will be of any use to you. You will also need to have the
source tree for linux 0.97 kernel sources available.

        This project was a team effort. The SCSI work was done by
David Giller rafetmad@cheshire.oxy.edu, and the filesystem was written
by Eric Youngdale eric@tantalus.nrl.navy.mil. So far, the code has been
tested with an aha1542 SCSI card and both NEC and Sony CDROM drives.
A number of different discs have been tested.

        To install, unpack the archive in your linux kernel directory
(usually /usr/src. This will add a number of new files to the linux
source tree). You will then need to apply the patches found in cdrom.diff
with the following command:

patch -p0 < cdrom.diff

and then build the kernel. Once you have booted the system, you will need
to add a device with major=11, minor=0 for the first cdrom drive, minor=1 for
the second and so forth. You can use a command something like:

        mknod -m 500 /dev/cdrom b 11 0

To mount a disc, use a command something like:

        mount -t iso9660 /dev/cdrom /mnt

I would be interested in hearing about any successes or failures with this
code.

KNOWN PROBLEMS:

        Error detection and handling really sucks rotten eggs in the
0.97 release. There were a couple of new features added to the kernel
in 0.97, and these inadvertently broke some things. There are two
errors are commonly generated with the CDROM drive, one occurs when
you attempt to use the drive without a disc in it, and the second
occurs the first time you use the drive after you have changed the
disc. The precise behavior that you will observe depends a lot upon
which SCSI adapter you are using. The adaptek driver will report
"aha1542_queuecommand: Two concurrent queuecommand?". In general this
is bad, but sometimes things will work themselves out. You may find
your VC's getting locked up when this happens. I expect that
kernel patches to 0.97 will fix a lot of the bugs in the error
handling (plus add support for multiple queued commands). The error
detection and handling for CDROMs worked quite nicely in the 0.96cpl2
kernel, so for the time being I am assuming that the problem lies with
the SCSI code.

        If you have a disc in the drive when you power up the system,
and you do not attempt to remove it while the system is up, then I
think that you will not have any trouble with the CDROM error
correction/detection.

             ********************************************

Some general comments are in order:

        On some drives, there is a feature where the drive can be
locked under software control to essentially deactivate the eject
button. The iso9660 filesystem activates this feature on drives so
equipt, so you may be unable to remove the disc while it is mounted.
The eject button will be re-enabled once the disc is dismounted.

        Since it is impossible to corrupt a CDROM, it is unlikely that
a bug in the iso9660 filesystem will lead to data corruption on your
hard disk, with the possible exception of files copied from the CDROM
to the hard disk. Nonetheless, it is a good idea to have a backup or
your hard disk, just in case. Then again, I really did not need to
say that, did I :-)

        There was a bug in error handling in the aha1542 code.
Previously when a command failed, the higher level drivers would not
receive the correct sense data from the failed command. This has been
fixed.

        Up until now, SCSI devices were either discs or tapes (and the
tapes have not been fully implemented). CDROM drives are now a third
category. There are several reasons why we do not want to treat then
the same as a regular hard disk, and it was cleaner to make a third
type of device. One reason was that.....

        The CDROM has a sector size of 2048 bytes, but the buffer
cache has buffer sizes of 1024 bytes. The SCSI high level driver for
the cdrom must perform buffering of all of the I/O in order to satisfy
the request. At some point in the near future support will be present
in the kernel for buffers in the buffer cache which are != 1024 bytes,
at which time this code will be remove.

        The find command does not work without the -noleaf switch.
The reason for this is that the number of links for each directory file
is not easily obtainable, so it is set to 2. The default behavior for
the find program is to look for (i_links-2) subdirectories in each
directory, and it then assumes that the rest are regular files. The
-noleaf switch disables this optimization.

        Only the ISO 9660 filesystem is supported. High Sierra is not
supported. The High Sierra format is just an earlier version of
ISO9660, but there are minor differences between the two. Sometimes
people use the two names interchangably, but nearly all newer discs
are the ISO9660 format. It would not be that difficult to support HS,
but I doubt that there are very many HS discs are out there. I will
add this if there is demand for it.

        The Rock Ridge extensions are not yet in the filesystem.
These extensions will allow for full length unix style pathnames,
deeper directories, uid/gid, file modes, etc. Once I get my hands on
a disc that uses these extensions, I will add support to the
filesystem. This standard is fairly new, and I do not have a sense of
whether this is going to catch on or not.

        The filesystem currently has the execute permission set for
any non-directory file that does not have a period in its name. This
is a crude assumption for now, but it kind of works. There is not an
easy way of telling whether a file should be executable or not.
Theoretically it is possible to read the file itself and check for a
magic number, but this would considerably degrade performance.

        The filesystem does not support block or character devices,
fifos, or symbolic links. Also, the setuid bit is never set for any
program. The main reason for this is that there is no information in
the directory entry itself which could be used to indicate these
special types of files. The Rock Ridge extensions were designed to
attempt to correct some of these deficiencies in the ISO9660 standard.

        Filenames under ISO9660 are normally all upper case on the
disc but the filesystem maps these to all lower case. The filenames
on the disc also have a version number (like VMS) which appears at the
end of the filename, and is separated from the rest of the filename by
a ';' character. The filesystem strips the version numbers from the
filename if the version number is 1, and replaces the ';' by a '.' if
the version number is >1.

        The inode numbers for files are essentially just the byte
offset of the beginning of the directory record from the start of the
disc. A disc can only hold about 660 MB, so the inode numbers will
be somewhere between about 60K and 660M. Any tool that performs
a stat() on the CDROM obviously needs to be recompiled if it was
compiled before 32 bit inode support was in the kernel.

        Text files on a CDROM can have several types of line
terminators. Lines can be terminated by LF, CRLF, or a CR. The
filesystem scans the first 1024 bytes of the file, searching for out
of band characters (i.e. > 0x80 or some control characters), and if it
finds these it assumes the the file is a binary format. If there are
no out of band characters the filesystem will assume that the file is
a text file (keeping track of whether the lines are terminated by a
CR, CRLF, or LF), and automatically converts the line terminators to a
LF, which is the unix standard. In the case of CRLF termination, the
CR is converted to a ' '.

        A number of ioctl functions have been provided, some of which
are only of use when trying to play an audio disc. An attempt has
been made to make the ioctls compatible with the ioctls on a Sun, but
we have been unable to get any of the audio functions to work. My
NEC drive and David's Sony reject all of these commands, and we currently
believe that both of these drives implement the audio functions using
vendor-specific command codes rather than the universal ones specified
in the SCSI-II specifications.

        The filesystem has been tested under a number of conditions,
and has proved to be quite reliable so far. This filesystem is
considerably simpler than a read/write filesystem (Files are
contiguous, so no file allocation tables need to be maintained, there
is no free space map, and we do not need to know how to rename, create
or delete files).

eric@tantalus.nrl.navy.mil