Collection Enumeration Loops, Iterators, & Nested Functions

Courtesy of Dr. Dobb's Journal (March 2004)

Yes, all languages do it, but D's approach is different

By Walter Bright and Matthew Wilson

Matthew is the author of the STLSoft (C++) libraries, several D Standard Libraries, and the forthcoming book Imperfect C++ (Addison-Wesley, 2004). He can be reached at matthew@synesis.com.au or http://stlsoft.org/. Walter is the author of the D programming language and the Digital Mars C/C++ compiler. He can be reached at http://www.walterbright.com/.

std.windows.registry

Manipulation of collections is a common operation in software engineering. Indeed, it is hard to conceive of a meaningful program that does not deal with at least one collection—argv in C, lists in Perl and Python, vector and list in C++, and strings in pretty much any language.

Enumeration is the process of sequentially visiting each element in a collection of elements. This works great for arrays. The most common (or, at least, most widely recognized) form of collection is the array, and the usual way of visiting each element of an array is with the for statement. Regardless of language, it is common to see something like Listing One.

But there are many other collection types, including linked lists, trees, associative arrays (maps), hash tables, sets, bags, and deques, to name a few. When you move beyond arrays, the syntactic commonality in the manipulation of collections is reduced, both between collection types in a given language and between the same collection types across languages. For example, in C, you can enumerate an array with Listing Two, whereas enumerating a list (Listing Three) looks quite different. This variation not only makes it difficult to change to a different containment model, but also places a greater burden of learning on you. Some languages, such as Perl and Python, have built-in support for enumerating collection types in a generalized form. Perl has the foreach statement (Listing Four), while Python has its for statement (Listing Five).

STL

For C++, the Standard Template Library (STL) has done an amazing job of homogenizing the syntax of manipulation of disparate collection types, and has done so according to the philosophy of C++—by adding library elements rather than the specification of new language features. STL supports generic programming by defining the concepts of Containers, Iterators, Algorithms, Function Objects, and Adaptors. Iterators navigate over ranges of elements. Containers yield Iterators, via their begin() and end() methods, that define a range of elements that may be enumerated. Algorithms operate on ranges of elements defined by Iterators. Function Objects (also called "Functionals" and "Functors") are types that can act as functions, with or without maintaining state on a per-instance basis, and may be applied to elements within a range by algorithms.

Hence, it is common to see code such as:

SomeContainerType cont;

for_each(cont.begin(), cont.end(), f);

where SomeContainerType can be any type that satisfies the STL Container concept requirements. In fact, it does not even have to do that: All that is required of SomeContainerType is that it has begin() and end() member functions, whose return values are (or can act as) Iterators. SomeContainerType could be a list, vector, or any other STL container type, or it could be a user-defined collection type, such as those provided in the STLSoft libraries (http://stlsoft.org/), an open-source organization that provides freely available STL-like extensions.

Similarly, f, which could be either a function or an instance of a class providing the function call operator operator () can do almost anything, so long as it is compile-compatible with the types manipulated by the Iterators provided by SomeContainerType. For example, if SomeContainerType is std::vector<std::string>, f would manipulate std::string references. Alternatively, if SomeContainerType is WinSTL's winstl::treeview_child_sequence (http://winstl.org/), f would manipulate HTREEITEM values. In either case, the enumeration algorithm for_each is simply a generic template definition, whose instantiating types are selected by the compiler at compile time. This is the basis of STL's generic programming, and it is powerful, flexible, and extensible.

There can be little doubt that STL saved C++ from obsolescence and, in our opinion, has placed it at the forefront of current language technology. However, D is another language that can make this claim. D looks like C and C++, but eliminates features that make programs difficult to write, debug, test, and maintain (see "The D Programming Language," by Walter Bright, DDJ, February 2002).

Enumeration in D

To support a generalized form of programming requires an enumeration construct that can iterate over any collection. D (available at http://www.digitalmars.com/d/) implements its collection enumeration in a novel way—in the guise of the foreach statement. Using foreach to enumerate the contents of an array looks like Listing Six. The body of the foreach statement works equivalently to other loop statements (in C-family languages); the break, continue, goto, and return statements within the body all have the usual meaning.

There is no longer an obvious loop index variable. Indeed, the idea of a loop index variable makes no sense for many kinds of collections. foreach works just as well on associative arrays; see Listing Seven.

Where the novel implementation of D's foreach really comes into its own is in its support for enumeration of any user-defined collection type. Other languages that support enumeration of user-defined collection types in a syntactically standard form usually require the implementation of specific enumerator types, whose instances are returned by the enumerated object.

For example, a COM Automation collection is required to implement the _NewEnum method, which returns an instance implementing the IEnumVARIANT interface. .NET requires that the collection type contains a method GetEnumerator() that returns an instance implementing the IEnumerator interface. In either case, a separate enumerator object must be created to act as a translating intermediary and to maintain the current state of the enumeration. Such intermediaries generally impact on efficiency since the additional level(s) of indirection have a speed cost; the intermediary itself is usually heap allocated and (in the case of COM, at least) requires timely deallocation.

In C++, intermediaries are represented by Iterators, which can be raw pointers or user-defined types. Such types can often be efficient in implementation, and are usually entirely (including member variables) representable on the stack, which means that they can achieve high efficiency in execution, though this is not always the case. However, even where this does hold true, the syntax of enumeration of STL Containers can leave a little to be desired. If you have a suitable function or Function Object or one that is Adaptable, then for_each may be employed. However, where the manipulation of the enumerated items is more complex, you either have to write a custom—and usually one-off—Function Object, or hand-code the enumeration (as in Listing Eight).

In D, none of these issues are relevant. The actual translation of a foreach statement by the compiler depends on the type of collection being enumerated. If it is an array, then the compiler generates a loop somewhat equivalent to the for loop described at the start of the article. However, if the collection is a user-defined type, then the compiler translates the foreach statement into a loop construct and the body of the statement into a delegate (a callback function). (Associative arrays are treated in much the same way as a user-defined collection.) All that is required of the collection type (which can be a heap-based class or stack-based struct) is that it defines an opApply() method (Listing Nine). This method is called in place of the foreach, passing the loop-body delegate, and the enumeration is conducted by the collection, which knows best how to do that.

The opApply() method handles all the details of traversing the data structure, including maintaining any state. The return value from the delegate dg() is used to communicate with the foreach control code. All values other than 0 are reserved by the implementation and represent the language of interaction between the compiler-generated delegate and the compiler-generated foreach handler. This is why the opApply() function must preserve it and return it if it is not zero. Authors of opApply() methods have a responsibility, therefore, to return 0 or delegate result, and not any other value.

This means that D's notion of a collection is extremely flexible. It can be a bonafide container, such as a list, or it can be a type that is not really a collection at all, similar to the STL notion of an input stream being a read-once collection. Furthermore, it is also simple to wrap operating system or framework collection APIs, and present them as first-class enumerable D types. This is the case with the D standard library's Win32 registry module (std.windows.registry); see the accompanying text box entitled "std.windows.registry."

Registry Enumeration

The registry module consists of several cooperating types—Registry, Key, Value, KeyNameSequence, KeySequence, ValueNameSequence, and ValueSequence—that together provide access to, and manipulation of, the registry on Win32 systems. A given registry Key provides access to its child keys and values in the form of instances of KeySequence and ValueSequence, returned via its Keys and Values properties. Listing Ten shows the implementation of the KeySequence class. (The _Reg_* functions are internal functions that map the raw Win32 API to D-friendly signatures, but that otherwise have the semantics of their mapped equivalents.)

There are several points to note:

The delegate's return value is tested and preserved. If it is not zero, enumeration is terminated and the value is returned to the caller unmolested.
There is an optimization in the fact that the maximum subkey name length is obtained outside the enumeration loop. Thus, the string retrieval buffer sName is created once, and the actual name within sName is passed to the Rey.GetKey() method as an array slice (which does not do any allocation).
In this implementation, a new Key instance is created each time through the loop. This is no less efficient than would be the case in other enumeration schemes (via the dereference of an STL Iterator to yield an instance of its value type). However, it is easy to imagine other kinds of collections for which it would be appropriate to do the "expensive" object allocation of an object once, and merely change its state for each enumeration point. Since the enumeration is conducted within the container, whose implementer knows whether such a thing would be sound, this affords an additional opportunity for optimization.
Since changing the reference passed to the delegate is meaningless for the KeySequence class (because a new Key instance is returned for each step in the enumeration), the inout qualifier on the delegate is not required. Contrast this with the tree enumeration example, where the use of inout allows—but does not require—that the reference can be changed; that is, the tree node's value changed. (Whether you would want to change the value of a tree node in the real world is, of course, a matter for your conscience, not to say your job security.)
If the retrieval of a registry key fails due to insufficient access rights (as will happen on just about any NT-family machine), the exception is swallowed, and the enumeration effectively skips that key. Without this, enumeration in many parts of the registry ends in failure. If you need to know about all parts of the registry, regardless of your rights to manipulate them, you can use the Key.KeyNames property, which returns an instance of KeyNameSequence type that enumerates all subkey names.
The opIndex() method facilitates index operator syntax; for example, ValueSequence vs = ...; vs[3];.
The invariant block is used when unit testing is switched on (-unittest to the compiler) to validate the state of the instance during method invocations. It is applied after all constructors, before the destructor, and before and after every method call.

Maintaining Complex Enumeration State

Again, the interesting aspect of this technique is that there doesn't need to be a separate enumerator object to maintain the state from one enumeration point to the next. All state is held by the opApply() function. Since D also supports nested functions (a powerful and much missed feature in C/C++), this represents an extremely powerful mechanism. For example, if the collection needs an arbitrarily large amount of state, the implementation of the enumeration steps can be entirely represented within the opApply() method, and is, therefore, simplified considerably; see Listing Eleven. The state required to visit each TreeEntry is neatly handled on the stack by the recursive nested function traverse(). Summing all the data on the Tree looks like Listing Twelve. If you were to do that via another technique, you would be in for some rather complex code. Naturally, sum could be abstracted in a template, as in Listing Thirteen.

Conclusion

We've seen how the foreach statement provides a uniform structure around which different kinds of enumerations can be supported. No complicated state-preserving iterator or enumerator classes are needed, even for arbitrarily complex element traversal algorithms. Naturally, all this (and more) can be achieved in C++ using the concepts of the STL. (One of us spends a lot of time doing just that!) However, it is clearly significantly easier to make an arbitrary collection enumerable using D's foreach/apply than it is to write an equivalent in C++. If you're not convinced, you're welcome to download the WinSTL registry classes (http://winstl.org/downloads.html) and the source for the std.windows.registry module (http://www.digitalmars.com/d/) and compare them for complexity and readability; they had the same author.

Debate in the D community is in the early stages regarding an equivalent to the STL for D. Unless and until such a library is implemented, foreach will be the primary enumeration mechanism. Only time will tell whether foreach wins out over STL-like techniques in D's generic containment libraries. What is certain is that for the enumeration of nongeneric container (or container-like) types (such as registry keys), foreach will continue to be a first choice mechanism.

DDJ

Listing One

T   ar[]; // Declare the array
for(int i = 0; i < ar.length; ++i)
{
  ar[i]; // Do something with the array element
}

Back to Article

Listing Two

T   ar[100]; // Declare the array
int i;
for(i = 0; i < sizeof(ar) / sizeof(ar[0]); ++i)
{
  ar[i]; // Do something with the array element
}

Back to Article

Listing Three

struct Link
{
  int   value;
  Link  *next;
} *g_head;
struct Link *l;
for(l = g_head; NULL != l; l = l->next)
{
  l->value; // Do something with the list element
}

Back to Article

Listing Four

@allfiles = ...;
foreach $file (@allfiles)
{
  print "File: " . $file . "\n";
}

Back to Article

Listing Five

languages = {}
for language in languages.keys():
    print language

Back to Article

Listing Six

T   ar[]; // Declare the array
foreach(T t; ar)
{
  ar[i]; // Do something with the array element
}

Back to Article

Listing Seven

uint[char[]] aa;    // associative array of uints indexed by a string
 ...
foreach (uint v; aa)
{
  v; // Do something with the value
}
foreach (char[] k; aa.keys)
{
  k; // Do something with the key
}
foreach (uint v, char[] k; aa)
{
  k; // Do something with the key
  v; // Do something with the value
}

Back to Article

Listing Eight

SomeContainerType             cont;
SomeContainerType::iterator   begin = cont.begin();
SomeContainerType::iterator   end   = cont.end();
for(; begin != end; ++begin)
{
  // Do something(s) with the elements
}

Back to Article

Listing Nine

struct IntegerRange
{
  this(int low, int high)
  {
    m_from  = low;
    m_to = high;
  }
  int opApply(int delegate(int value) dg)
  {
    int res;
    for(int i = m_from; i < m_to; ++i)
    {
      if(0 != (res = dg(i)))
      {
        break;
      }
    }
    return res;
  }
}

Back to Article

Listing Ten

public class KeySequence
{
private:
  this(Key key)
  {
    m_key = key;
  }
  invariant
  {
    assert(null !== m_key);
  }
public:
  uint Count()
  {
    return m_key.SubKeyCount();
  }
  Key GetKey(uint index)
  {
    ... // Does something similar to the opApply()
        // method to get hold of the requisite key
  }
  Key opIndex(uint index) // Provides indexing operator []
  {
    return GetKey(index);
  }
public:
  int apply(int delegate(Key key) dg)
  {
    int     result  = 0;
    HKEY    hkey    = m_key.m_hkey;
    DWORD   cSubKeys;
    DWORD   cchSubKeyMaxLen;
    LONG    res     = _Reg_GetNumSubKeys(hkey, cSubKeys, cchSubKeyMaxLen);
    char[]  sName   = new char[1 + cchSubKeyMaxLen];
    for(DWORD index = 0; 0 == result; ++index)
    {
      DWORD   cchName = 1 + cchSubKeyMaxLen;
      LONG    res     = _Reg_EnumKey(hkey, index, sName, cchName);
      if(ERROR_NO_MORE_ITEMS == res)
      {
        // Enumeration complete
      }
      else if(ERROR_SUCCESS == res)
      {
        try
        {
          Key key = m_key.GetKey(sName[0 .. cchName]);
          result = dg(key);
        }
        catch(RegistryException x)
        {
          if(x.Error == ERROR_ACCESS_DENIED)
          {
            // Skip inaccessible keys; they are
            // accessible via the KeyNameSequence
            continue;
          }
          throw x;
        }
      }
      else
      {
        throw new RegistryException("Enumeration incomplete", res);
        break;
      }
    }
    return result;
  }
private:
  Key m_key;
}

Back to Article

Listing Eleven

struct TreeEntry
{
  TreeEntry *m_left;
  TreeEntry *m_right;
  char[]    m_data;
}
struct Tree
{
  TreeEntry *m_root;
  int opApply(int delegate(inout char[] value) dg)
  {
    int traverse(TreeEntry* te)
    {
      int result;
      while(null != te)
      {
        result = dg(te.m_data);
        if(0 != result)
        {
          return result;
        }
        result = traverse(te.m_left);
        if(0 != result)
        {
          return result;
        }
        te = te.m_right;
      }
      return 0;
    }
    return traverse(m_root);
  }
}

Back to Article

Listing Twelve

int sumTree(Tree tree)
{
  int sum = 0;
  foreach(int value; tree)
  {
    sum += value;
  }
  return sum;
}

Back to Article

Listing Thirteen

template sum(R, V)
{
  R sumit(T t)
  {
    R sum;
    foreach (int value; t)
    {
      sum += value;
    }
    return sum;
  }
}
DDJ

Back to Article