Advogato: Why COM Stinks, Since You Asked

This is not an essay or a reasoned discourse so much as a list of aggravations and headaches I've had over the years with COM. I've come to believe that COM not only stinks, but has wasted more programmer time to less good purpose than any other API or technology I can think of. My dislike for this API knows no bounds.

1. COM was not a designed technology. That is, it was never thought out completely beforehand. Microsoft never intended COM to be as prevalent and deeply-embedded in Windows as it is today; it was intended to replace the horrible and unreliable DDE. It was needed to allow Microsoft's new Office suite interoperate more gracefully. Microsoft's developers, being the good dogmatists they are, adopted the then-cutting-edge concept of component software. That is, rather than having many monolithic pieces of software all doing different things, you'd simply write a bunch of objects which would expose interfaces you could plug into your own applications.

But COM was really only thought out as a way to link a few applications together: a spreadsheet (Excel), a word processor (Word), and e-mail (MAPI). It didn't then (and still doesn't) operate very gracefully with database applications, even Microsoft's own Access product.

Over time, Microsoft moved more and more low-level system work into COM objects. It was modern! It was high-tech! It made software more modular! But it also made software dependencies ferociously complex, and horrendously difficult to debug. You can't really flowchart a modern application's information flow if it is a COM application -- you can't tell what the I/O is doing at a given moment because you can't get a good picture of how many programs are holding instances to the COM object.

And then there's the worst and most glaring defect of COM: a bug in a low-level component means that all other programs that depend on that component will be buggy too. And if the component is a vedor-written one, it cannot be fixed except by the vendor.

Finally, because Microsoft kept the DLL (dynamically linked library) format for libraries and just shoehorned COM objects into them, we still have to deal with "DLL hell" -- one system DLL can overwrite another. In COM-land this problem is even worse, since the COM interfaces of two DLLs may look exactly the same, but may have different implementations depending on the DLL installed.

2. The Registry. A modern Windows system has to support literally thousands of discrete COM interfaces in order to function. How are all these interfaces to be tracked and managed, especially in networked environments? Some genius at Microsoft came up with the idea of a central, heirarchical registry of settings.

Now, this in itself isn't a bad idea. Where Microsoft went wrong was in making it the central repository for the entire operating system. Furthermore, they designed it as a binary database rather than plain-text. In practice, this means two things: if the Registry gets corrupted (a fairly common occurrence for a developer) it can keep your machine from even booting up, and it means you can't just use a text-editor to fix it. If you hose the Registry badly enough, a re-install is your only recourse. This is an unconscionable design decision, and in my view the very worst thing about COM.

Contrast this with most Unix systems, where system configuration files are kept separate from application configuration files, and nearly all configuration files are plain-text. Further, a faulty configuration file here or there will not prohibit the system from booting up (it may fail to work correctly, but it will usually boot). (The only exception to this that I can think of is to mess up a lilo.conf or grub.conf file.)

3. Security. Oh, we don't need to bring up the latest Blaster worm to belabor this point. All we have to do is look at the dependencies for any substantial COM program, and try to figure out where any possible buffer-overrun or elevated-privilege situation might come up. It's hopeless. COM is pervasive in Windows, and this means that Windows is not securable even in principle. There's just no way to figure out all possible program pathways or all possible input/output combinations. In the name of making all applications "interoperable", Microsoft has chosen to make everything equally insecure.

A worse problem is that, in order for many users to be able to install ActiveX plugins (a fancy name for -- you guessed it -- COM components), they have to have elevated system privileges (usually Administrator). So if you open up an infected e-mail in Outlook, it (being a good COM citizen) will automatically invoke a helper program to decode the content of said message, and all of a sudden...you're boned. Sure, you can argue that users should be smarter than to open binary attachments from strangers -- but should Microsoft really make it that easy for someone to shoot themselves in the foot? Or to make it that easy for a virus/worm/trojan to propagate that easily or quickly among networked machines?

The only way to make a Windows machine secure is to turn the power off.

4. COM is a programmer's nightmare. Anyone who has had to wrestle with the rat's nest of ATL (ActiveX Template Library, or the C++ interface to COM) knows what I'm talking about. It is impossible to write clean COM code. In fact, reading COM code isn't like reading C++ code at all -- it's a maze of casts, macros, compiler-specific keywords, and ide-generated templates that give no clear idea as to the code's underlying structure or purpose.

Another peeve I have is that ATL/COM pollutes the global namespace -- many things get #define'd and typedef'd in the myriad header files, and few of them are adequately documented. Worse, if you happen to use a #define of your own that conflicts with one that COM or ATL defines, things can break in obscure and very hard-to-debug ways. (If you are lucky, the compile will fail; if you are unlucky, the program will compile but then fail at run-time with infuriatingly weird behavior that can take ages to figure out.)

In my darker moments, I wonder if the engineers at Microsoft secretly hate their jobs, and want to make sure that we hate ours, too.

CONCLUSION

Oh, I could go on and on about how and why COM is a horrible thing. The best thing you can say about it is that it (kind of) works. But even this is faint praise -- there are other approaches that work just as well with far less pain. They could have adopted the Unix method of piping one program's output into another's input, or they could have adopted a systemwide scripting language like Apple's AppleScript. (The did eventually do this -- with Scripting Host -- but it came too late and was too limited to do much good.)

The worst part of this whole sorry saga is that we're stuck with COM. Windows these days is mostly made up of the kernel plus a big grab-bag of COM-based APIs. Even the venerable ODBC -- the most decent API Microsoft ever came up with -- has been deprecated in favor of OLE DB (which is slower and less secure, but hey!, it's based on COM). Even the vaunted .NET stuff relies on COM under the covers. It makes the programmer's job (somewhat) less vexing, but it does nothing for the security side of things. And there's always the Registry to put the gray in your hair.

So what's a programmer to do? My advice: just do your best, and use COM sparingly, and only if you absolutely need to have it. Even in these debased times, it's possible to write COM-free software on Windows.

Why COM Stinks, Since You Asked

Posted 21 Sep 2003 at 19:11 UTC by mrorganic

Well said., posted 21 Sep 2003 at 21:02 UTC by DeepNorth » (Journeyer)

Not the reasons I would pick....., posted 22 Sep 2003 at 12:22 UTC by listen » (Journeyer)

.Net won't save them from making the same mistakes, posted 24 Sep 2003 at 10:09 UTC by murrayc » (Master)

Protocols, not components, posted 24 Sep 2003 at 12:49 UTC by Omnifarious » (Journeyer)

I always wondered about that, posted 24 Sep 2003 at 17:05 UTC by sej » (Master)

Building And Linking A Library Shouldn't Be Hard As COM, posted 24 Sep 2003 at 17:42 UTC by nymia » (Master)

First, a point by point analysis of this article, posted 26 Sep 2003 at 21:17 UTC by pphaneuf » (Journeyer)

Next, the comments..., posted 26 Sep 2003 at 23:08 UTC by pphaneuf » (Journeyer)

Finally, the diary entries..., posted 26 Sep 2003 at 23:40 UTC by pphaneuf » (Journeyer)

A few clarifications, posted 27 Sep 2003 at 13:11 UTC by mrorganic » (Journeyer)

A few more clarifications, posted 27 Sep 2003 at 21:06 UTC by pphaneuf » (Journeyer)

Clarify this!, posted 29 Sep 2003 at 13:25 UTC by apenwarr » (Master)

How can we build a better world?, posted 29 Sep 2003 at 14:20 UTC by Malx » (Journeyer)

Huh, posted 30 Sep 2003 at 11:29 UTC by listen » (Journeyer)

Never breaking interfaces, posted 30 Sep 2003 at 18:28 UTC by pphaneuf » (Journeyer)

Interface-driven programming..., posted 1 Oct 2003 at 19:04 UTC by mrorganic » (Journeyer)

?, posted 1 Oct 2003 at 21:11 UTC by Malx » (Journeyer)

what is XPLC trying to solve, posted 1 Oct 2003 at 22:32 UTC by pphaneuf » (Journeyer)

XPLC doesn't prevent you from using your favourite C++ features, posted 13 Oct 2003 at 17:24 UTC by apenwarr » (Master)