Advogato: Component System Looking for Design Peer-review

Initial insights

My idea went through multiple iterations of prototyping and getting myself to think outside the box. After some quite complex scheme, it became clear to me that I wasn't going to turn around the whole compiler and linker situation. It's been quite a while (at least 20 years) that better solutions existed and are now tried-and-true, but the good old compiler/linker duo is still there, seemingly to last. So my flow of thought went around this.

To achieve this, I tried abstracting what was going on when you link software together. There are two main things that happen: gathering of interface information and the process of actually getting to the code.

In C++, the former is done by the compiler when going through the header files and is codified in the generated assembler as offset constants into virtual method tables and other such things. The latter is done by the linker when resolving symbols.

In a component system, interfaces provide the first, and the service manager the second.

Complexity is to be avoided as much as possible. DCE is a prime example of this, leading to a whole lot of problems that can arguably be called a failure.

Transparent distributed components are explicitly not part of my plans. RPC (remote procedure call) as a mean of achieving distributed components never really took off and is being phased out even by its bigger supporters like Microsoft toward more explicit message-oriented communication. See AnoteOnDistributedComputing.

I will cite Mozilla XPCOM and Microsoft COM as influences on my basic design. This article will assume a basic understanding of the common interface-based COM/XPCOM style components.

XPLC Core

The bare essential of XPLC is composed of a small number of interfaces and an even smaller number of components.

There is a single entry point that is not part of a component, the XPLC::getServiceManager() function. This entry point is how XPLC is bootstrapped and doesn't need to be called more than once. An example of calling it more than once would be two unrelated pieces of code wishing to use XPLC, say, the main executable code and some library code. The service manager, following the SingletonPattern, is shared among those unrelated pieces of code, providing a rendez-vous point. XPLC-aware code get the service manager passed to them as part of their initialization process, as it is needed to do pretty much anything.

The service manager itself is a very simple piece of code, it whole purpose is being a hook for extensions. The most often used method of the service manager is the getObject method, which gives you the object associated with a UUID.

A UUID (universally unique identifier), for those not familiar with them, are 128 bits numbers that are unique across time and space, for all practical purpose. They have been invented for use in the DCE, but have been used in various other environments, like COM (which based its RPC layer on the DCE RPC) and the Linux ext2 filesystem (as a volume identifier).

The service manager does not do the mapping of UUID to object itself, but rather leverages components called service handlers, which can be added and removed from the service manager. When the getObject method of the service manager is invoked, it iterates over its list of service handlers and invoke their own getObject method, until one returns an object. XPLC has only one default handler, a static mapping of UUID to objects, that is used for all components that are not dynamically loaded, such as those provided by XPLC itself and those linked in to the main executable.

I would like to point out that the XPLC core does not have any configuration file or use any environment variable in its working. The core itself is extremely simple, under complete programmer control and will not do anything it was not prompted to do (like file or network access of any kind).

Component Categories

These are central in the extensibility of XPLC applications. The general principle is that a category is a UUID that is a list of other UUIDs rather than directly an object.

Categories are going to be implemented through a category manager component, which is a service handler. When the UUID of a category will is asked for, the category manager will build up a category object containing the list of UUIDs.

Categories have a concept of a default component, which can be either the first available component in the list, or a specific component (this is so that it can be made configurable by the user of an application).

Dynamic Loading

There are two ways of doing dynamic loading included with XPLC, but keep in mind that these are not the only two possible ways. They are themselves implemented outside of the XPLC core, using only facilities also available to any other components. For example, dynamic loaders capable of handling Java, Python or Perl instead of dynamically loaded shared libraries would be possible.

The first dynamic loader is a simple component that can load a single DLL (called shared object in the Unix world, but I will use DLL for this article to avoid confusion with the concept of "objects that are shared") and make its components available when hooked as a service handler. It's only parameter is the filename of the DLL to load. The DLL will not be unloaded automatically when unused, only when the loader is actually destroyed. In this regard, it is very similar to the explicit dlopen or LoadLibrary interfaces.

The second one is more complicated, covering a whole directory. When created, the component will survey the directory it has been given as a parameter to find any DLLs containing components and build up a map of the UUIDs serviced by these DLLs. When asked for a UUID, the dynamic loader will load the required DLL if needed, and will automatically unload it if it is not needed anymore. If possible, this UUID to DLL mapping information can be saved in a cache file, along with the modification time of the directory and of each DLL, so that the cache is properly invalidated if a change occurs.

Note that these two dynamic loaders are ordinary components and are not singletons like the service manager is. In fact, a single instance of the simple dynamic loader can only load a single DLL, so an application wishing to load multiple DLLs would simply create an instance of the simple dynamic loader component per DLL.

There is a potential problem with these. The service manager does not have any specific ordering in its use of service handlers, so prioritizing component modules is not possible with this design. Also, the directory-based dynamic loader could face the problem of having more than one DLL implementing the same UUID (different versions of the same component for example) in the same directory.

Ordering of service handlers withing the service manager could be resolved through changing the service manager interface to have appendHandler and prependHandler methods instead of the single addHandler method.

Ordering of components within a directory could be resolved by the modification time (the newest component wins). I do not find this solution particularly appealing, but it is the only one I see at the moment.

Versioning

The COM-style interface-based components are a very good basis for versioning, as has been pointed out in ComVsCorba.

I also have the intention of enabling version verification at every possible point of friction, such as the libxplc.so/XPLC.DLL itself, loadable components, and so on. This is important to allow refactoring later on without crash and burns "notification".

Comparisons

Here are some "competitors" to XPLC, for comparison purposes:

XPCOM
XPCOM, while being a good thing overall (I'd rather see XPCOM dominate than Microsoft COM), suffered from the pressure of having to release the Mozilla project. A lot of it is thrown together and it seem to lack focus. A lot of things made it into what the Mozilla project calls XPCOM that doesn't have a whole lot to do with components, so things are getting a bit bloated code-wise.
While they would like it to be used in more places, they are putting their efforts into making it what Mozilla needs first. This is one of the reasons that I think an independent project might be better to take care of something shared with other projects such as a component system.
CrystalSpace Shared Class Facility
This project shares some of the goals of XPLC, most proeminently being lightweight. They also share a problem I see with XPCOM, that it is part of another project.

I did not mention COM or CORBA on purpose, as they do not share enough of the goals of XPLC to be considered on the same level.

Implementation

Some notes about the implementation of XPLC as it stands now.

It is developed using some ExtremeProgramming. For example, the test suite accounts for roughly 30%-35% of the code, and the tests are developed before the component themselves are implemented.
It is weighting in at under 2000 lines of code currently, and should still be below 4000 lines of code with everything described in this article be implemented.
The stripped libxplc.so shared library weights in at 24 kilobytes on Linux/Intel. I would be surprised to see the full implementation of XPLC reach 100 kilobytes.
A DLL containing XPLC components does not have to link with libxplc.so/XPLC.DLL.

As you can see, XPLC is trying hard to live up to the "lightweight" part of its name.

Conclusion

Starting with this base, I have confidence that XPLC could be used to build complex and extensible systems. To me, one of the things I like most about XPLC is its simplicity, which can then be used to build maintainable complex systems, similarly to the Unix pipes. Unix pipes have been designated as the only truly successful component system, so I see sharing this important attribute with them as a good sign.

The lack of built-in transparent remoting in XPLC should not be an obstacle to distributed components, because such a remoting or messaging layer could easily be added to it, just like rsh or ssh can be used to make distributed systems out of Unix pipes, without them directly supporting remoting.

Thanks in advance to the Advogato community for your thoughtful comments!

Component System Looking for Design Peer-review

Posted 1 Jan 2001 at 01:50 UTC by pphaneuf

A bit too simple?, posted 1 Jan 2001 at 18:55 UTC by hp » (Master)

Re: A bit too simple?, posted 1 Jan 2001 at 21:37 UTC by pphaneuf » (Journeyer)

Threading, posted 2 Jan 2001 at 20:04 UTC by nymia » (Master)

Re: Threading, posted 2 Jan 2001 at 20:35 UTC by pphaneuf » (Journeyer)

Comments from e-mail, posted 2 Jan 2001 at 21:19 UTC by pphaneuf » (Journeyer)

More e-mail comments, posted 2 Jan 2001 at 21:21 UTC by pphaneuf » (Journeyer)

Even more e-mail comments!, posted 2 Jan 2001 at 22:30 UTC by pphaneuf » (Journeyer)

Replying to e-mail comments, posted 2 Jan 2001 at 22:58 UTC by pphaneuf » (Journeyer)

Yet another e-mail comment, posted 3 Jan 2001 at 05:56 UTC by pphaneuf » (Journeyer)

QI'ing leads to DLL hell, posted 3 Jan 2001 at 08:20 UTC by nymia » (Master)

I hit the reply button too early, posted 3 Jan 2001 at 08:26 UTC by nymia » (Master)

Re: QI'ing leads to DLL hell, posted 3 Jan 2001 at 18:40 UTC by pphaneuf » (Journeyer)

Curious about your goals, posted 3 Jan 2001 at 20:57 UTC by sab39 » (Master)

Re: Curious about your goals, posted 3 Jan 2001 at 22:26 UTC by pphaneuf » (Journeyer)

Is the simplicity there?, posted 3 Jan 2001 at 23:33 UTC by Malx » (Journeyer)

Re: Curious about your goals, posted 4 Jan 2001 at 06:46 UTC by shalabh » (Journeyer)

Maybe interfaces should not have QI(), posted 4 Jan 2001 at 07:44 UTC by pphaneuf » (Journeyer)

Re: Is the simplicity there?, posted 4 Jan 2001 at 08:03 UTC by pphaneuf » (Journeyer)

Re: Curious about your goals, posted 4 Jan 2001 at 08:11 UTC by pphaneuf » (Journeyer)

About KParts and XParts, posted 4 Jan 2001 at 09:21 UTC by pphaneuf » (Journeyer)

Re: Curious about your goals, posted 4 Jan 2001 at 13:49 UTC by shalabh » (Journeyer)

how much have you looked at CORBA ?, posted 4 Jan 2001 at 15:41 UTC by stefan » (Master)

Comments, posted 4 Jan 2001 at 17:37 UTC by nymia » (Master)

Component system complexity, posted 4 Jan 2001 at 19:12 UTC by apenwarr » (Master)

Re: Comments, posted 4 Jan 2001 at 19:56 UTC by pphaneuf » (Journeyer)

Re: Comments, posted 4 Jan 2001 at 20:38 UTC by nymia » (Master)

Re: Component system complexity, posted 4 Jan 2001 at 22:05 UTC by stefan » (Master)

Then where is simplicity?, posted 5 Jan 2001 at 00:26 UTC by Malx » (Journeyer)

one more..., posted 5 Jan 2001 at 00:31 UTC by Malx » (Journeyer)

Re: Then where is simplicity?, posted 5 Jan 2001 at 03:06 UTC by pphaneuf » (Journeyer)

Anti-DCE, posted 8 Jan 2001 at 15:31 UTC by lkcl » (Master)

Complexity, posted 8 Jan 2001 at 15:48 UTC by lkcl » (Master)

DCE and complexity, posted 9 Jan 2001 at 16:02 UTC by pphaneuf » (Journeyer)

.net, posted 11 Jan 2001 at 14:28 UTC by lkcl » (Master)

.net??, posted 20 Jan 2001 at 22:37 UTC by aaronv » (Apprentice)

Internet Explorer, posted 23 Jan 2001 at 19:43 UTC by pphaneuf » (Journeyer)

The DCE critics link has changed., posted 13 Jun 2001 at 20:04 UTC by pphaneuf » (Journeyer)

XPCOM status, posted 2 Aug 2001 at 17:30 UTC by pphaneuf » (Journeyer)