I/O

Goals

Learn about file systems and path representations.
Represent paths using Java interfaces.
Access files.
Work with byte-oriented input and output streams.

Concepts

absolute path
buffer
directory
Faux Pas
file
file system
flush
input/output (I/O)
input stream
mark
output stream
parent directory
path
relative path
reset
root directory
try-with-resources

Language

try (with resources)

Library

Lesson

The Evolution of Java I/O

The JDK has been released with various input/output (I/O) libraries over the years. Some aspects of newer libraries replaced older ones. Other aspects continue to coexist with old classes. Here is a quick overview of the evolution of Java I/O over the years, just to get a feel for what has come before and to recognize some of the terminology when you see it.

IO - The original classes in the java.io package concentrated on traditional I/O streams and file random-access classes. Many of these classes can still be used, although newer libraries bring alternatives for special applications. The java.io.IOException exception class still pervades I/O code, but the the java.io.File class which was central to this package should not be abandoned for new code.
NIO - Java introduced the java.nio package (new I/O), which included new whiz-bang concepts such as java.nio.Buffer and channels. These will be discussed in upcoming lessons.
NIO.2 - Java added some new approaches to asynchronous I/O, but most significantly across the board was the introduction of the java.nio.file.Path interface (to replace java.io.File), along with many other classes in the java.nio.file package. Note that Java did not create a new java.nio2 package for these additions; instead the new classes are scattered across packages.

Exceptions

java.io.IOException: A checked exception traditionally used as a general indication of I/O error.
java.io.UncheckedIOException: An unchecked exception representing an I/O error. This class was recently added. Especially useful for code with lambda expressions, as many functional interfaces do not allow for checked exceptions.

UncheckedIOException can wrap an IOException, so that you can use the following pattern if you access I/O code in a lambda expression, for example:

try {
  //TODO call methods that can throw IOException
} catch(final IOException ioException) {
  throw new UncheckedIOException(ioException);
}

Just because you've wrapped the IOException doesn't mean the problem is solved; you should probably catch the UncheckedIOException up the calling chain and unwrap it by throwing UncheckedException.getCause() so that code even higher in the stack expecting an IOException will function correctly.

When used with Java streams, the overhead of wrapping and unwrapping IOException using UncheckedIOException can bloat your source code so much that it becomes confusing and even unreadable. You may need resort to a dangerous technique: “sneaky throw” the exception using a library such as Faux Pas to wrap lambda expressions within a stream. If you choose this option, you must make sure you catch the exception, as the compiler will not detect an oversight. This technique was discussed in depth in the lesson on streams.

File Systems

Computers store persisted information in files on some file system. Different file systems have different aspects such as security, attributes, and case sensitivity. Examples include NTFS (primarily Windows) and ext4 (primarily Linux).

Windows maintains a potentially different file system for each drive, each identified by a letter and a colon such as C:. Linux creates a virtual file system on top of all mounted drives; this virtual file system has a root of / for the entire system.

Java represents information about a file system using the java.nio.FileSystem class. To get the default file system use the helper class java.nio.FileSystems method FileSystems.getDefault().

Java addresses file in a case significant manner; filenames with uppercase and lowercase letters are distinct. The NTSF file system used by the Windows operating system is also case significant, but Windows itself sees files without case sensitivity! (See Filenames are Case Sensitive on NTFS Volumes.) That is, Java would see the files foobar.txt and FOOBAR.TXT as separate files, but Windows would think that both names refer to the same file! Put another way, if Java asks for the file foobar.txt, Windows would return the contents of a FooBar.TXT file if it existed; this would not happen on Linux! Java sees files as case-sensitive, but Windows does not.

The files on a file system are usually divided into plain files, and directories, which are used to hierarchically groups. A file (or directory) is identified by its path. A path can be an absolute path if it indicates the complete path (from the outermost or root directory) necessary to locate a file; or a relative path if it only indicates the portion of the path necessary to locate the file from some other directory.

Paths are separated into parts by a separator character; on Linux and related systems this is the forward slash / character; on Windows it is the backslash \ character. An earlier part in the path indicates the parent directory of the directory or file later in the path.The special directory names . and .. refer to the current directory and parent directory, respectively.

The FileSystem class has a getSeparator() method to provide you the correct path separator for directories in a path, so you could use FileSystems.getDefault().getSeparator(). Although much legacy code will use the java.io.File.separator and java.io.File.separatorChar constants to retrieve the same value, it's best not to use the java.io.File class unless you are interfacing with legacy code. Especially be careful not to mistakenly use the java.io.File.pathSeparator and java.io.File.pathSeparatorChar constants, which provide the character used between paths to separate one path from another, not to separate components inside a single path. Most of the time you should not be building paths manually, anyway, as this is tedious and error-prone. Build paths using the correct classes and helper methods, explained below.

The Windows file system may have several root directories such as C:\ and D:\, while Linux file systems have a single root directory /. You can list all the available root directories using FileSystem.getRootDirectories(). See Path below for information on how to processed the returned values.

File path examples.

Path	Relative/Absolute	Description
`.`	relative	Current directory.
`..`	relative	Parent directory.
`foobar.txt`	relative
`./foobar.txt`	relative	Same file as above.
`foo/example.txt`	relative
`../bar/example.txt`	relative
`C:\foo\bar.txt`	absolute	File system on Windows OS.
`/etc/foo/bar.txt`	absolute	File system on Linux OS.

Files that start with . such as .foo are typically used as configuration files and directories. Directories and files with a beginning . are hidden by default on Linux, and have become somewhat customary for this purpose across operating systems.

You can find the current user's home directory by using System.getProperty("user.home"). You should convert the returned string to the appropriate path object, as explained in the Path section below. This approach for getting the user's home directory was unreliable on version before Java 8. See Bug JDK-6519127.

`Path`

Java provides a versatile interface for identifying files and directories: the java.nio.file.Path class. You can get a Path instance by asking the FileSystem for it using FileSystem.getPath(…). Rather than calling FileSystems.getDefault().getPath(…), you can use the java.nio.file.Paths utility class using the method Paths.get(…).

Various types of paths using the Path class.

//relative paths
final Path path1 = Paths.get("bar.txt");
final Path path2 = Paths.get("foo" + FileSystems.getDefault().getSeparator() + "bar.txt"); //manual construction
final Path path3 = Paths.get("foo", "bar.txt"); //same as path2 but using preferred approach
//files
final Path windowsExample1 = Paths.get("C:\\foo\\bar.txt");
final Path windowsExample2 = Paths.get("C:", "foo", "bar.txt"); //same as windowsExample1
final Path linuxExample = Paths.get("/etc/foo/bar.txt");
//directory
final Path linuxDirectory = Paths.get("/etc/foo/");

Useful Path methods, using directory /foo/bar/ as an example.

`Path` Method	Description	`Paths.get("/foo/bar/")`	Returns
`Path.getRoot()`	The root of the path.	`.getRoot()`	`/`
`Path.getFileName()`	The name of the file or directory	`.getFileName()`	`bar`
`Path.getNameCount()`	The number of name elements in the path.	`.getNameCount()`	`3`
`Path.isAbsolute()`	Whether the path is absolute.	`.isAbsolute()`	`true`
`Path.relativize(Path other)`	Determines the relative path from this path.	`.relativize("/foo/bar/some/example.txt")`	`some/example.txt`
`Path.resolve(Path other)`	Combines this path with a relative path.	`.resolve("some/example.txt")`	`/foo/bar/some/example.txt`
`Path.resolve(String other)`	Combines this path with a relative path.	`.resolve("some/example.txt")`	`/foo/bar/some/example.txt`

A Path only identifies a file by its path/name; it is not a replacement for the file itself—a file or directory might not even exist at that path. If you rename a file, for example, the Path instance that identified that file will still indicate the previous path after the rename operation.

Paths.get(…) is inflexible because is makes assumptions about the file system context. Throughout most of your code you should use Path.resolve(…) for creating more paths in relation to existing path instances.

The Path class even implements Iterable<Path> so that you can iterate over the name elements in the path—each of them provided as an instance of Path itself.

Files

For actually working with files on a disk, you can use the utilities in the java.nio.file.Files class. This class contains a wealth of methods, including methods for checking whether a file is readable or writable.

Files.createDirectories(Path dir, FileAttribute<?>... attrs): Creates a hierarchy of directories if they do not exist. No error is generated if one or more of the directories already exist.
Files.createDirectory(Path dir, FileAttribute<?>... attrs): Creates a single new directory. An error may be given if the directory already exists.
Files.exists(Path path, LinkOption... options): Checks to see if a file exists at the path.
Files.isDirectory(Path path, LinkOption... options): Determines whether a path represents a directory.
Files.list(Path dir): Returns a Stream<Path> listing all the paths in a directory. Using stream filtering and processing operations, you can easily return a list of only files with certain filenames for example. The stream returned by this method must be closed, or you will leak resources which could eventually crash your application.

The Stream.close() method is infrequently used and even less frequently discussed, but after listing directories, closing the stream is absolutely essential. If you don't close the Stream<Path> returned from Files.list(…), Java may leave files open on the file system! After repeated use the system will run out of low-level handles used for accessing files, preventing your application from performing further file operations. You can easily ensure a Stream is closed using the Try-with-Resources technique explained below.

Most legacy Java code accesses files using the java.io.File class, but this approach is not recommended for new code; the File class does not separate the concepts of file identification with file system access. Use the Path, FileSystem, and related classes for new code, and only resort to using File if you need to interact with legacy APIs. You can get a Path instance from a File by calling File.toPath(), or get a legacy File instance from a Path by calling Path.toFile().

Byte Streams

The most fundamental approach to processing I/O in Java relies on specialized classes that allow programs to process information a byte at a time. An input stream allows a program to read a stream of bits from a data source as bytes. An output stream allows a program to write a stream of bits to a data source, one or more bytes at a time.

The low-level input stream and output stream classes are used to process I/O bytes; they are completely unrelated to the Stream<T>-based classes you learned about in a previous lesson for processing a sequence of objects using functions.

Many byte streams allow you to wrap an existing stream using the decorator pattern. You can create a buffered version of any input stream, for example, by using new BufferedInputStream(existingInputStream).

All byte streams implement java.io.Closeable, which gives them a Closeable.close() method. You usually need to call close() after using a stream, even if an error occurred! This can be accomplished using a try … finally block.

final InputStream inputStream; //TODO get input stream somehow
try {

  //TODO process data from the input stream

} finally {
  inputStream.close(); //always close the input stream
}

The preferred way to ensure that streams are closed is to use Try-with-Resources, explained below.

Input Streams

The following input stream classes are all in the java.io package.

InputStream class diagram. — `InputStream` class diagram.

InputStream: Abstract class that forms the basis of all input streams.
BufferedInputStream: Provides buffering of other input streams.
ByteArrayInputStream: An input stream to an existing array of bytes.
FileInputStream: An input stream to a file. This class uses the old java.io.File class and should only be used with legacy code.
FilterInputStream: A simple input stream wrapper allowing subclasses to do more processing on data after reading.
DataInputStream: Provides methods to read primitive Java types in a consistent way across platforms.
ObjectInputStream: An input stream that allows deserialization of Java objects and their instance graphs.

Output Streams

The following output stream classes are all in the java.io package.

OutputStream class diagram. — `OutputStream` class diagram.

OutputStream: Abstract class that forms the basis of all output streams.
BufferedOutputStream: Provides buffering of other output streams.
ByteArrayOutputStream: An output stream to a dynamically managed internal array of bytes. The collected data can later be retrieved using ByteArrayOutputStream.toByteArray().
FileOutputStream: An output stream to a file. This class uses the old java.io.File class and should only be used with legacy code.
FilterOutputStream: A simple output stream wrapper allowing subclasses to do more processing on data before writing.
DataOutputStream: Provides methods to write primitive Java types in a consistent way across platforms.
ObjectOutputStream: An output stream that allows serialization of Java objects and their instance graphs.
PrintStream: An output stream that helps write certain data using methods such as println(). This class does not correctly encode character and strings across platforms; it should not be used unless you have no other option.

Reading Single Bytes

The abstract class java.io.InputStream forms the basis of all byte stream-based input. Its main method is InputStream.read(), which returns eight bits of information (a byte)—but the byte is returned as an int! This is because the special int value -1 is used to indicate that no further bytes are available to be read (the end of the stream has been reached). If a byte value were used, there would be no way to distinguish between a value -1 indicating the end of the stream, and the byte (which is signed) value -1 representing 0b11111111.

The following example shows how to read from an input stream consisting of an existing array of bytes using java.io.ByteArrayInputStream. The first half of the example merely creates a sequence of bytes to serve as the data to read.

Reading individual bytes from an InputStream until the end of the stream is reached.

//create an array with values 0 ... 255 (256 bytes, or 0x100)
final byte[] inputBytes = new byte[0x100];
for(int i = 0; i < inputBytes.length; i++) {
  inputBytes[i] = (byte)i;  //we know the value isn't larger than a byte (0xFF)
}

//create an input stream from the byte array
final InputStream inputStream = new ByteArrayInputStream(bytes);
//read and print each byte until we reach the end of the stream
try {
  int byteValue;
  while((byteValue = inputStream.read()) != -1) {
    System.out.println(byteValue);
  }
} finally {
  inputStream.close();
}

Be especially careful working with byte and int values. Remember that byte is signed. In the loop above, we had to use an int counter to fill the byte array, because a byte counter would have wrapped around to negative numbers after 127. But we know that integer values up to 0xFF can fit into a byte, so the cast is OK.

Try-with-Resources

You already know how to use try … finally … to ensure that you close a Closeable resource in the finally {…} clause. Java offers a further enhancement of the try statement: if a class implements java.lang.AutoCloseable (and the Closeable interface extends AutoCloseable, so all input and output streams are candidates), it can be used in a try-with-resources statement. Simply declare and assign the AutoCloseable resource in parenthesis after the try keyword. Java will automatically add, in the compiled code, the equivalent of a finally clause that calls close() on the resource, whether or not the try clause throws an exception. Here is how the above try … finally statement would be rewritten to use try-with-resources:

Reading from an InputStream using try-with-resources.

//create an array with values 0 ... 255 (256 bytes, or 0x100)
final byte[] inputBytes = new byte[0x100];
for(int i = 0; i < inputBytes.length; i++) {
  inputBytes[i] = (byte)i;  //we know the value isn't larger than a byte (0xFF)
}

//create an input stream from the byte array
try(final InputStream inputStream = new ByteArrayInputStream(bytes)) {
  int byteValue;
  while((byteValue = inputStream.read()) != -1) {
    System.out.println(byteValue);
  }
}

As an added benefit, if the close() method throws an exception after the try clause throws an exception, it is recorded but the original exception is the one thrown. Contrast this with a manual finally clause, in which any exception thrown by close() will immediately be thrown, losing the original exception that was thrown from the try clause.

You can declare more than one AutoCloseable resource if you separate them by the semicolon ; character.

Mark and Reset

There may be times you are reading from a stream and decide, oops, I wish I could unread some information, and go back to start reading at some earlier location. The InputStream class has a facility for placing a marker at location to later go back to.

At any time when reading from an input stream, you can call Inputstream.mark(int readlimit) to request the input stream to mark the current location. The readlimit value indicate the maximum number of bytes you might read before wanting to go back to the mark.
If you later call InputStream.reset() you will reset the stream to the marked location, and the next bytes read will be those directly after the marked location—even if you've already read those bytes earlier.

The mark/reset facility therefore provides a way for the input stream to somehow remember any bytes (up to the readlimit you provided) you read after the mark and somehow effectively put them back into the input stream to be read again.

If you need to support mark/reset but a given InputStream doesn't support mark/reset, find an InputStream that does, such as BufferedInputStream, and wrap the given an InputStream with that.

Writing Single Bytes

The complement to InputStream is the java.io.OutputStream. An output stream allows writing of single bytes using OutputStream.write(int b). But moving data between streams using a byte at a time is inefficient; there are much more efficient ways to move data between stream, as explained in the following sections.

As an int is much bigger than a byte, OutputStream.write(int b) only writes the low-order eight bits of the int; the other 24 bits are ignored. Using an int makes it easier to pass an unsigned eight-bit value for writing.

Reading and Writing Multiple Bytes

Many times you will want to read and writer larger sections of data by transferring it to and from a buffer, an area of memory designated for transferring the data. InputStream provides an InputStream.read(byte[] b) method that reads bytes into an existing byte array buffer. There always exist the possibility that, for whatever reason, fewer bytes (even 0!) might be read; this method therefore returns an int indicating the number of bytes read. If the method returns -1, it indicates that the end of the stream has been reached.

We can use such a buffer to copy between two streams. OutputStream provides a corresponding OutputStream.write(byte[] b), but this method assumes that the entire buffer is full and that all the bytes shoudl be written. Because the read operation may not have filled the buffer, we must take care to only write the number of bytes that were read each time around. This can be done using the OutputStream.write(byte[] b, int off, int len), which allows the starting offset (in this case 0) and a length (in this case the number of bytes read), the number of bytes to read.

In this example we copy everything from the input stream to a java.io.ByteArrayOutputStream which collects all the bytes, which we then print out, using the ByteArrayOutputStream.toByteArray() method.

Copying from an InputStream to an OutputStream using a buffer.

//create an array with values 0 ... 255 (256 bytes, or 0x100)
final byte[] inputBytes = new byte[0x100];
for(int i = 0; i < inputBytes.length; i++) {
  inputBytes[i] = (byte)i;  //we know the value isn't larger than a byte (0xFF)
}

//create a buffer array for copying up to 16 bytes at a time (an arbitrary value)
final byte[] buffer = new byte[0x10];

//create a destination output stream for the bytes
final ByteArrayOutputStream baos = new ByteArrayOutputStream();

//copy a buffer at a time until we reach the end of the input stream
try {
  try(final InputStream inputStream = new ByteArrayInputStream(inputBytes)) {
    int count;
    while((count = inputStream.read(buffer)) != -1) {  //-1 indicates end of stream
      baos.write(buffer, 0, count);
    }
  }
} finally {
  baos.close();
}

//print out the bytes in the destination stream
final byte[] outputBytes = baos.toByteArray();
for(final byte byteValue : outputBytes) {
  System.out.println(Byte.toUnsignedInt(byteValue)); //print unsigned values
}

We declare the ByteArrayOutputStream outside of the try statement because we need to access it later to get its bytes using toByteArray(). This means we can't use try-with-resources with it, which is why we have nested try statements to make sure it's closed. Technically we don't truly need to close the ByteArrayOutputStream after using it, as it is really just a wrapper to some bytes in memory, but we do so here out of good habit.

Note the use of while((count = inputStream.read(buffer)) != -1) to assign a value (because we need to use that value later) and compare the value—all within the same statement.

We use baos.write(buffer, 0, count) to write only the number of bytes that were read into the buffer. Remember that the buffer may not have been filled (maybe we were reading from a slow Internet connection); we might have read even zero bytes!

File Streams

You can get an input stream for reading from a file, or an output stream for writing to a file, by using the Files.newInputStream(Path path, OpenOption... options) or the Files.newOutputStream(Path path, OpenOption... options) method, respectively. Here's an example of printing out all the bytes in a /etc/foo/bar.txt file.

Reading from a file using an InputStream.

final static Path path = Paths.get("/etc/foo/bar.txt")
try(final InputStream inputStream = Files.newInputStream(path)) {
  int byteValue;
  while((byteValue = inputStream.read()) != -1) {
    System.out.printlnt(Byte.toUnsignedInt(byteValue));
  }
}

As already mentioned, Java provides a java.io.FileInputStream and a java.io.FileOutputStream for working with the old java.io.File class. You should only use these for legacy code.

Buffered Streams

Java provides java.io.BufferedInputStream and java.io.BufferedOutputStream for converting any input or output stream to a buffered version. These classes make working with relatively slow connections more efficient, because they will read or write blocks of data to an internal buffer in memory. You can still read and write the data a byte at a time, but you will be accessing an internal buffer which is much quicker that reading or writing data a byte at a time with e.g. a hard drive. These classes will transfer the data in blocks to the ultimate destination when needed. Because these classes use the decorator pattern, you can simply wrap an existing stream on the fly. There is no need to close the underlying stream; closing the wrapper stream will close the decorated stream as well.

Buffered reading from a file using a BufferedInputStream.

final static Path path = Paths.get("/etc/foo/bar.txt")
try(final InputStream inputStream = new BufferedInputStream(Files.newInputStream(path))) {
  int byteValue;
  while((byteValue = inputStream.read()) != -1) {
		System.out.printlnt(Byte.toUnsignedInt(byteValue));
  }
}

You should always use a BufferedInputStream or BufferedOutputStream when working directly with a relatively slow data source. But there is no point in wrapping a ByteArrayInputStream or a ByteArrayOutputStream in a buffered stream, as these streams refer to in-memory data and are essentially buffered by definition. Similarly there is no need to wrap input and output streams in a buffered stream when you are copying between them using your own buffer, as in the example above.

The flush() method available with every OutputStream is especially useful for a BufferedOutputStream if you need to clear out the buffered data and pass it down the chain immediately.

If you create a method that takes an OutputStream as a parameter and wraps it with a BufferedOutputStream inside the method, you must call BufferedOutputStream.flush() when you are done! The caller does not have access to the BufferedOutputStream created inside the method, and will have no way to ensure the buffered bytes get written, even if the caller flushes and closes the original OutputStream given as an argument.

Review

Summary

Use java.nio.file.Path to identify files and directories.
Use java.nio.file.Files to manipulate the actual file a Path refers to.

Gotchas

If you are typing a string containing a Windows path, don't forget to escape the backslash character! C:\foo\bar in Java must be entered as "C:\\foo\\bar" for Java to correctly understand the string.
The FileSystem.getPathSeparator() method is used to separate more than one path; if you want the separator for separating directory parts in the same path, use FileSystem.getSeparator().
The PrintStream methods for printing characters and strings won't correctly encode all characters.
You must close close the Stream<Path> returned from Files.list(…) (using try-with-resources if you like) or Java will leave files open on the file system and eventually be unable to perform further file operations.
The InputStream.read(byte[] b) and related method may not fill the given buffer—and in fact may not read any bytes at all, even if the end of the stream is not reached! Be sure and check the returned value to find out how many bytes were read, if any.
If your method is passed an open stream, don't close it; this is the caller's responsibility.
If your method wraps OutputStream given by the caller in an BufferedOutputStream, you must flush the output before returning, as the caller will have no access to the buffered data.

In the Real World

Most code you come across will use File.separator and File.separatorChar to indicate the file system's separator character for building paths. To get out of the habit of using the java.io.File class, you can use FileSystems.getDefault().getSeparator() instead. Try not to build paths manually anyway; use the appropriate Path constructor or utility method.
Don't use PrintStream unless you can help it; there are better approaches for reading and writing strings, as you will learn in an upcoming lesson.
If you need to support mark/reset but a given InputStream doesn't support mark/reset, find an InputStream that does, such as BufferedInputStream, and wrap the given an InputStream with that.
Copying data using buffers is much more efficient than copying data a byte at a time.

Self Evaluation

If you read a buffer of bytes from an input stream, and then try to print out each byte individually with no conversation, why might some of them be printed as negative numbers? Does this matter?
When would you want to use a BufferedInputStream or a BufferedOutputStream? When would you not want to use these classes?

Task

You are going to create a repository implementation that uses the file system for as its data store, storing publications in individual files. You have not yet learned how to store the individual publications, but prepare for this eventuality by implementing the basic FilePublicationRepository class structure.

The repository will be given a Path during its creation that indicates the directory in which it should store information.
The repository will ensure that the directory exists and is indeed a directory for storing publications using the FilePublicationRepository.
- The validation of the directory will be done in an initialize() method of the repository. It is not a good idea to do I/O operations in the constructor, as they may throw a IOException which is messy to deal with at this point.
- When initialize() is called, verify that the repository directory exists. If the directory does not exist, create it.
- That the directory is appropriate for a file repository will be determined by the presence of a signature.dat file. The file will contain the following bytes: 2, 3, 4, 234, 0, 0, 0, 234. These bytes have no meaning other than to verify the purpose of the directory. You may wish to use the online Hex-works tool, indicated in the Resources section, for producing this file if you have no hex editor.
- When initialize() is called, if the signature file does not exist in the repository directory, create it. If the signature file does exist, verify that it contains exactly the correct bytes; if not, throw an IOException.
- As part of the implementation of signature checking, create a separate method that takes an InputStream and validates that it contains the signature bytes. You will be able to create a unit test for this method and pass in test sequences using a ByteArrayInputStream. Your main signature checking method based upon a file path will delegate to the InputStream version.
- To provide consistency across implementations, you therefore need to put the initialize() method in the PublicationRepository interface, and make clear in its contract that initialize() must be called for each repository implementation after creation. You may want to give this method a default implementation that does nothing to make it easier for implementation that do not need initialization. For completeness you should call initialize() for your SnapshotPublicationRepository as well, even though in this implementation this method does nothing.
- In order to create a robust API, you will want to specify a contract and implementation that detects if PublicationRepository.initialize() is never called before a repository is used, and which throws a IllegalStateException if called more than once for a repository.
Do not actually hook up your FilePublicationRepository to the Booker application, yet; continue using the snapshot repository for now.
In your main Booker application's constructor, create and store a path to a .booker directory in the user's home directory. This will be the main directory Booker users to store its configuration and data.
When the Booker starts running, check to make sure the .booker directory exists; if not, create it.

You may remove the getBinaryByteValues(byte[] bytes) method and related tests from the previous lesson if you no longer need it.

You are not yet able to test your FilePublicationRepository class, with the exception of the signature verification method above based on an InputStream. If you were to create a test directory on your computer and then create unit tests based upon your those directories, the tests would fail when ran on another developer's computer. In an upcoming lesson you will learn how to create tests that can create their own test environment as necessary and therefore will not be tied to the state of a particular machine.

References

Resources

Hex-works: Online hex editor tool.
HxD: Free hex editor and disk editor. (Windows)
Hex Editor Neo: Free hex editor optimized for large files. (Windows)

Faux Pas

Acknowledgments

Some symbols are from Font Awesome by Dave Gandy.

I/O

Goals

Concepts

Language

Library

Lesson

The Evolution of Java I/O

Exceptions

File Systems

Path

Files

Byte Streams

Input Streams

Output Streams

Reading Single Bytes

Try-with-Resources

Mark and Reset

Writing Single Bytes

Reading and Writing Multiple Bytes

File Streams

Buffered Streams

Review

Summary

Gotchas

In the Real World

Self Evaluation

Task

See Also

References

Resources

Acknowledgments

`Path`