Older blog entries for apenwarr (starting at number 633)

9 Jul 2014 »

The Curse of Vicarious Popularity

I had already intended for this next post to be a discussion of why people seem to suddenly disappear after they go to work for certain large companies. But then, last week, I went and made an example of myself.

Everything started out normally (the usual bit of attention on news.yc). It then progressed to a mention on Daring Fireball, which was nice, but okay, that's happened before. A few days later, though, things started going a little overboard, as my little article about human nature got a company name attached to it and ended up quoted on Business Insider and CNet.

Now don't get me wrong, I like fame and fortune as much as the next person, but those last articles crossed an awkward threshold for me. I wasn't quoted because I said something smart; I was quoted because what I said wasn't totally boring, and an interesting company name got attached. Suddenly it was news, where before it was not.

Not long after joining a big company, I asked my new manager - wow, I had a manager for once! - what would happen if I simply went and did something risky without getting a million signoffs from people first. He said something like this, which you should not quote because he was not speaking for his employer and neither am I: "Well, if it goes really bad, you'll probably get fired. If it's successful, you'll probably get a spot bonus."

Maybe that was true, and maybe he was just telling me what I, as a person arriving from the startup world, wanted to hear. I think it was the former. So far I have received some spot bonuses and no firings, but the thing about continuing to take risks is my luck could change at any time.

In today's case, the risk in question is... saying things on the Internet.

What I have observed is that the relationship between big companies and the press is rather adversarial. I used to really enjoy reading fake anecdotes about it at Fake Steve Jobs, so that's my reference point, but I'm pretty sure all those anecdotes had some basis in reality. After all, Fake Steve Jobs was a real journalist pretending to be a real tech CEO, so it was his job to know both sides.

There are endless tricks being played on everyone! PR people want a particular story to come out so they spin their press releases a particular way; reporters want more conflict so they seek it out or create it or misquote on purpose; PR people learn that this happens so they become even more absolutely iron-fisted about what they say to the press. There are classes that business people at big companies can take to learn to talk more like politicians. Ironically, if each side would relax a bit and stop trying so hard to manipulate the other, we could have much better and more interesting and less tabloid-like tech news, but that's just not how game theory works. The first person to break ranks would get too much of an unfair advantage. And that's why we can't have nice things.

Working at a startup, all publicity is good publicity, and you're the underdog anyway, and you're not publicly traded, so you can be pretty relaxed about talking to the press. Working at a big company, you are automatically the bad guy in every David and Goliath story, unless you are very lucky and there's an even bigger Goliath. There is no maliciousness in that; it's just how the story is supposed to be told, and the writers give readers what they want.

Which brings me back to me, and people like me, who just write for fun. Since I work at a big company, there are bunch of things I simply should not say, not because they're secret or there's some rule against saying them - there isn't, as far as I know - but because no matter what I say, my words are likely to be twisted and used against me, and against others. If I can write an article about Impostor Syndrome and have it quoted by big news organizations (to their credit, the people quoting it so far have done a good job), imagine the damage I might do if I told you something mean about a competitor, or a bug, or a missing feature, or an executive. Even if, or especially if, it were just my own opinion.

In the face of that risk - the risk of unintentionally doing a lot of damage to your friends and co-workers - most people just give up and stop writing externally. You may have noticed that I've greatly cut back myself. But I have a few things piling up that I've been planning to say, particularly about wifi. Hopefully it will be so technically complicated that I will scare away all those press people.

And if we're lucky, I'll get the spot bonus and not that other thing.

Syndicated 2014-07-08 23:05:01 from apenwarr

1 Jul 2014 »

The Curse of Smart People

A bit over 3 years ago, after working for many years at a series of startups (most of which I co-founded), I disappeared through a Certain Vortex to start working at a Certain Large Company which I won't mention here. But it's not really a secret; you can probably figure it out if you bing around a bit for my name.

Anyway, this big company that now employs me is rumoured to hire the smartest people in the world.

Question number one: how true is that?

Answer: I think it's really true. A suprisingly large fraction of the smartest programmers in the world *do* work here. In very large quantities. In fact, quantities so large that I wouldn't have thought that so many really smart people existed or could be centralized in one place, but trust me, they do and they can. That's pretty amazing.

Question number two: but I'm sure they hired some non-smart people too, right?

Answer: surprisingly infrequently. When I went for my job interview there, they set me up for a full day of interviewers (5 sessions plus lunch). I decided that I would ask a few questions of my own in these interviews, and try to guess how good the company is based on how many of the interviewers seemed clueless. My hypothesis was that there are always some bad apples in any medium to large company, so if the success rate was, say, 3 or 4 out of 5 interviewers being non-clueless, that's pretty good.

Well, they surprised me. I had 5 out of 5 non-clueless interviewers, and in fact, all of them were even better than non-clueless: they impressed me. If this was the average smartness of people around here, maybe the rumours were really true, and they really had something special going on.

(I later learned that my evil plan and/or information about my personality *may* have been leaked to the recruiters who *may* have intentionally set me up with especially clueful interviewers to avoid the problem, but this can neither be confirmed nor denied.)

Anyway, I continue to be amazed at the overall smartness of people at this place. Overall, very nearly everybody, across the board, surprises or impresses me with how smart they are.

Pretty great, right?

Yes.

But it's not perfect. Smart people have a problem, especially (although not only) when you put them in large groups. That problem is an ability to convincingly rationalize nearly anything.

Everybody rationalizes. We all want the world to be a particular way, and we all make mistakes, and we all want to be successful, and we all want to feel good about ourselves.

We all make decisions for emotional or intuitive reasons instead of rational ones. Some of us admit that. Some of us think using our emotions is better than being rational all the time. Some of us don't.

Smart people, computer types anyway, tend to come down on the side of people who don't like emotions. Programmers, who do logic for a living.

Here's the problem. Logic is a pretty powerful tool, but it only works if you give it good input. As the famous computer science maxim says, "garbage in, garbage out." If you know all the constraints and weights - with perfect precision - then you can use logic to find the perfect answer. But when you don't, which is always, there's a pretty good chance your logic will lead you very, very far astray.

Most people find this out pretty early on in life, because their logic is imperfect and fails them often. But really, really smart computer geek types may not ever find it out. They start off living in a bubble, they isolate themselves because socializing is unpleasant, and, if they get a good job straight out of school, they may never need to leave that bubble. To such people, it may appear that logic actually works, and that they are themselves logical creatures.

I guess I was lucky. I accidentally co-founded what turned into a pretty successful startup while still in school. Since I was co-running a company, I found out pretty fast that I was wrong about things and that the world didn't work as expected most of the time. This was a pretty unpleasant discovery, but I'm very glad I found it out earlier in life instead of later, because I might have wasted even more time otherwise.

Working at a large, successful company lets you keep your isolation. If you choose, you can just ignore all the inconvenient facts about the world. You can make decisions based on whatever input you choose. The success or failure of your project in the market is not really that important; what's important is whether it gets canceled or not, a decision which is at the whim of your boss's boss's boss's boss, who, as your only link to the unpleasantly unpredictable outside world, seems to choose projects quasi-randomly, and certainly without regard to the quality of your contribution.

It's a setup that makes it very easy to describe all your successes (project not canceled) in terms of your team's greatness, and all your failures (project canceled) in terms of other people's capriciousness. End users and profitability, for example, rarely enter into it. This project isn't supposed to be profitable; we benefit whenever people spend more time online. This project doesn't need to be profitable; we can use it to get more user data. Users are unhappy, but that's just because they're change averse. And so on.

What I have learned, working here, is that smart, successful people are cursed. The curse is confidence. It's confidence that comes from a lifetime of success after real success, an objectively great job, working at an objectively great company, making a measurably great salary, building products that get millions of users. You must be smart. In fact, you are smart. You can prove it.

Ironically, one of the biggest social problems currently reported at work is lack of confidence, also known as Impostor Syndrome. People with confidence try to help people fix their Impostor Syndrome, under the theory that they are in fact as smart as people say they are, and they just need to accept it.

But I think Impostor Syndrome is valuable. The people with Impostor Syndrome are the people who *aren't* sure that a logical proof of their smartness is sufficient. They're looking around them and finding something wrong, an intuitive sense that around here, logic does not always agree with reality, and the obviously right solution does not lead to obviously happy customers, and it's unsettling because maybe smartness isn't enough, and maybe if we don't feel like we know what we're doing, it's because we don't.

Impostor Syndrome is that voice inside you saying that not everything is as it seems, and it could all be lost in a moment. The people with the problem are the people who can't hear that voice.

Syndicated 2014-06-30 07:05:56 from apenwarr

30 Jun 2013 »

Reflections on Marketing and Humanity

Today I have new evidence that the human brain is made up of multiple interoperating, loosely connected components. Because I was out buying dryer sheets and there's one with "Fresh Linen" scent. And while one part of my mind was saying, "That's rather tautological," another part was saying, "That's what I always wanted my linen to smell like!" So I bought it, and now you know which one wins.

In the same aisle I found a new variant of soap with the tagline "inspired by celtic rock salt." Now, inspiration can be a hard thing to pin down, but this soap contains nothing celtic and no salt. I'm not even sure there is such a thing as celtic rock salt, or if there is, that it differs in any way from other rock salt, or other salt for that matter. Moreover, the whole purpose of soap is to wash off the generally saltly sweaty smelly mess you produced naturally, so we'd probably criticize them if it *were* salty, for the same reason people criticize shampoos for stripping your hair of its natural oils only to sell the oils back to you in the form of conditioner. Also, how long has soap had an "Ingredients" section on the package? Why not a nutritional content section? And is it bad when the first ingredient is (literally) "soap"? But I bought it anyway, because Irish. Salt. Mmmm.

Finally, a note to you people who would argue that I'm overanalyzing this. You might define overanalyzing as analyzing beyond the point required to make a decision. Since the analysis figured not one bit into my purchasing decision, by that definition, any analysis at all would be considered overanalysis. And frankly, that just doesn't seem fair.

Syndicated 2013-06-30 19:38:22 from apenwarr - Business is Programming

28 Jun 2013 (updated 14 Jul 2014 at 07:02 UTC) »

2013-06-28

A quote from "The Trouble With Computers" about usability studies at Apple when they were developing the original Macintosh:

Apple interface guru Bruce Tognazzini tells this story. The in-box tutorial for novices, "Apple Presents... Apple," needed to know whether the machine it was on had a color monitor. He and his colleagues rejected the original design solution, "Are you using a color TV on the Apple?" because computer store customers might not know that they were using a monitor with the color turned off. So he tried putting up a color graphic and asking, "Is the picture above in color?" Twenty-five percent of test users didn't know; they thought maybe their color was turned off.
Then he tried a graphic with color named in their color, GREEN, BLUE, ORANGE, MAGENTA, and asked, "Are the words above in color?" Users with black and white or color monitors got it right. But luckily the designers tried a green-screen monitor too. No user got it right; they all thought green was a fine color.
Next he tried the same graphic but asked, "Are the words above in more than one color?" Half the green-screen users flunked, by missing the little word "in". Finally, "Do the words above appear in several different colors?"
Success.

That was what Apple did for a single throwaway UX question in a non-core part of the product - before its first release. It apparently took about 5 iterations of UX design followed up by UX research before they finally converged on the right answer.

The lesson I learned from that: usability studies are important. But you can't just take the recommendations of a usability study; you have to implement the recommendations, do another study, be prepared to be frustrated that the new version is just as bad as the old one, and do it all again. And again. If that's not how you're doing usability studies, you're doing it wrong.

Maybe I should re-buy that book. I gave mine away at some point. It's kind of indispensable as a tool for explaining software usability research, if only for its infamous "Can you *not* see the cow?" photo.

Syndicated 2013-06-28 03:47:19 (Updated 2014-07-14 07:02:06) from apenwarr - Business is Programming

12 Jun 2013 »

Cheap Thrills

I've heard it said that you can just alternate between two UI themes once a week, and every time you switch, the new one will feel prettier, newer, and more exciting than the old one.

This is a natural tendency. The human mind is intrigued by change. That's where fashion comes from, and fads. It gives you a little burst of some chemical, maybe adrenaline (fear of the unknown?), or endorphins (appreciation of the unexpected?), or perhaps some other kind of juice I heard of somewhere but I don't really know what it does.

In tech, this kind of unlimited attraction to the unexpected is the main characteristic of the first phase of the Technology Adoption Lifecycle, the so-called "Innovators."

Source: Wikimedia Commons

Perhaps people are happy to be included in the Innovator category. But Innovation isn't just doing something different for the sake of being different. Real innovation is the *willingness* to take the *risk* to do something different, because you know that difference is expensive, but that it will pay off in some way that more conservative sorts will fail to recognize until later.

In fashion, the end goal is to catch people's attention; if you do that, you are innovative. That's why fashion repeats itself every few years: because you can be innovative over and over again with the same ideas, rehashed forever.

In technology, we can hold you to a higher standard. Innovation requires difference, but it also requires a vision of usefulness. Change is expensive. Staying the same is cheap. Make it worth my while. Or if I'm an Innovator, or even an Early Adopter, at least give me a hint about how it's worth my while so I can exploit it while others are too afraid.

Every needless change creates expensive fragmentation. Microsoft ruled their market by being change averse. So did IBM. So did Intel. Even Apple. Whenever they forgot this, they stumbled.

Change aversion works because what makes a platform successful isn't so much the platform as the complementary products. For a phone, that means third-party power adapters, car chargers, headphones with integrated volume controls, alarm clocks with a connector to charge your phone *and* play your music at the same time. For a PC, it could be something as simple as maintaining the same power supply connector across many years' worth of models, so that anyone who standardizes on your brand will have an ever-growing investment in leftover power supplies plugged in wherever they might want them. For an operating system, it means keeping the same approximate style of UI for a long time, so that apps can learn to optimize for it, and a really great app made two years ago can keep on selling well, perhaps with bugfixes and new features but no need for rewrites, because it still looks like it's perfectly integrated into your OS experience. That sort of consistency allows developers to focus on quality instead of flavour, and produces an overall feeling of well-integratedness. It makes people feel like when they buy your thing, they're paying for quality. And yes, people - moving beyond the innovators into the more profitable market segments of the curve - will definitely pay for quality.

Real design genius lies in the ability to make something look pretty, and with gentle updates to keep it modern looking, without causing huge disruption to your whole ecosystem every couple of years. Following fashion trends, while not caring about disruption, does not require genius at all. All it requires is a factory in a third-world country and some photos of what you want to copy.

Ironically, even app developers mostly fail to recognize just how bad it is for them when a platform changes out from under them unnecessarily. Instead, they get excited by it. Finally, I get to rewrite that UI code I really hated, and while I'm there, I can fix all those interaction bugs I knew we had but could never justify repairing! Because now I *have* to rewrite it!

Redesigning things to match a moving target of a platform is really comforting, because it's a ready-made strategy for your company. The truth is, you don't have to think about what customers want, or how to make the workflow smoother, or how to eliminate one more click from that common operation, or how to fix that really annoying network bug that only happens 1 in 1000 times. Those bugs are hard; this feels like freedom. We'll just dedicate our team to "refreshing" the UI, again, for another few months, and nobody can complain because it's obviously necessary. And it is, obviously, necessary. Because your platform has screwed you. Your platform changed for no reason, and that's why your users can't have what they really need. They'll get a UI refresh instead.

And although they are less productive, they will love it. Because of endorphins, or sodium, or whatever.

And so you will feel good about yourself in the morning.

Syndicated 2013-06-12 07:35:30 from apenwarr - Business is Programming

9 Jun 2013 (updated 30 Jun 2014 at 06:02 UTC) »

A Modest Proposal for the Telephone Network

You might not realize it, but there's an imminent phone number shortage. It's been building up for a while, but the problem has been mitigated by people using "PBXes", which basically add a 4-5 digit extension to the end of your phone number to expand the available range. The problem with PBXes is they don't work right with caller id (it makes it look like a bunch of people near each other all have the same phone number) and you can't easily direct-dial PBX extensions from a phone's integrated address book, unless your phone has some kind of special "PBX penetration" technology. (PBX penetration is pretty well-understood, but not implemented widely.)

Even worse: it's no longer possible to route phone calls hierarchically by the first few digits. Nowadays any 10-digit U.S. phone number could be registered anywhere in the U.S. and area codes change all the time.

So here's my proposal. Let's fix this once and for all! We'll double the number of digits in a Canada/U.S. phone number from 10 to 20. No, wait, that might not be enough to do fully hierarchy-based call routing, let's make it 40 digits. But that could be too much typing, so instead of using decimal, we can add a few digits to your phone dialpad and let you use hexadecimal instead. Then it should only be 33 digits or so, with the same numbering capacity as 40 decimal digits! Awesome!

It'll still be kind of a pain to remember numbers that long, but don't worry about it, nobody actually dials directly by number anymore. We have phone directories for that. And modern smartphones can just autodial from hyperlinks on the web or in email. Or you can send vcards around with NFC or infrared or QR codes or something. Okay, those technologies aren't really perfect and there are a few remaining situations where people actually rely on the ability to remember and dial phone numbers by hand, but it really shouldn't be a problem most of the time and I'm sure phone directory technology will mature, because after all, it has to for my scheme to work.

Now, as to deployment. For a while, we're going to need to run two parallel phone networks, because old phones won't be able to support the new numbering scheme, and vice versa. There's an awful lot of phone software out there hardcoded to assume its local phone number will be a small number of digits that are decimal and not hex. Plus caller ID displays have a limited number of physical digits they can show. So at first, every new phone will be assigned both a short old-style phone number and a longer new-style phone number. Eventually all the old phones will be shut down and we can switch entirely to the new system. Until then, we'll have to maintain the old-style phone number compatibility on all devices because obviously a phone network doesn't make any sense if everybody can't dial everybody else.

Actually you only need to keep an old-style number if you want to receive *incoming* calls. As you know, not everybody really needs this, so it shouldn't be a big barrier to adoption. (Of course, now that I think of it, if that's true, maybe we can conserve numbers in the existing system by just not assigning a distinct number to phones that don't care to receive calls. And maybe charge extra if you want to be assigned a number. As a bonus, people without a routable phone number won't ever have to receive annoying unsolicited sales calls!)

For outgoing calls, we can have a "carrier-grade PBX" sort of system that basically maps from one numbering scheme to the other. Basically we'll reserve a special prefix in the new-style number space that you'd dial when you want to connect to an old-style phone. And then your new phone won't need to support the old system, even if not everyone has transitioned yet! I mean, unless you want to receive incoming calls.

...

Or, you know. We could just automate connecting through a PBX.

Syndicated 2013-06-09 16:08:21 (Updated 2014-06-30 06:02:03) from apenwarr - Business is Programming

26 Apr 2013 (updated 26 Apr 2013 at 08:02 UTC) »

blip: a tool for seeing your Internet latency

I just released a pretty neat tool that I wrote, called blip. You can read the README to find out what's going on. Or, if your RSS reader or web browser supports iframes, you can see it action right here:

</p><p> That's a graph of *your* Internet performance, right now. Cool, right? And it's way more accurate than speedtest.net at predicting your real-life web browsing performance. Although maybe a bit harder to interpret the results. </p><p> (If your RSS reader *doesn't* support iframes, you can visit the app at <a href="https://web.archive.org/web/20170628121952/http://gfblip.appspot.com/">gfblip.appspot.com</a>. Try it on your phone or tablet.) </p><p> For more information, motivation, philosophy, and ranting, read the <a href="https://web.archive.org/web/20170628121952/https://github.com/apenwarr/blip#readme">README</a>. </p><p> And it's open source. Have a nice day. </p><p class="syndicated"><a href="https://web.archive.org/web/20170628121952/http://apenwarr.ca/log/?m=201304#26">Syndicated 2013-04-26 07:01:30 (Updated 2013-04-26 08:02:04) from apenwarr - Business is Programming</a></p></div> </div> <div class="node apenwarr"> <div class="blogdate"><a name="626"><b>13 Jan 2013</b></a> (updated 13 Jan 2013 at 14:02 UTC) <a href="/web/20170628121952/http://www.advogato.org/person/apenwarr/diary/626.html" style="text-decoration: none">»</a></div><div class="content"> <p><b>In which the White House outshines Canadian Politics</b></p> <p> People who are aware of <a href="https://web.archive.org/web/20170628121952/http://apenwarr.ca/log/?m=201102#03">my political view template</a> know that I try to follow a simple process, which is to try to reject low-quality arguments that resort to rhetoric and personal attacks. The result is I sometimes sound like I'm in favour of some policy or motion that I actually disagree with (or vice versa) because I tend to end up arguing about the presentation, and noting the complexity of the problem, rather than just choosing a side and joining the fray. Since I complain that you're being stupid, you assume that I think the opposite point of view is less stupid, but that's missing the point. </p><p> In short, I want to see politicians (and politically interested citizens) raising the level of discourse. Having written off American politics long ago, I'm still disappointed when Canadians result to <a href="https://web.archive.org/web/20170628121952/http://apenwarr.ca/log/?m=201001">meaningless sludge</a> instead of stopping to understand what's going on. </p><p> So imagine my surprise when I discovered an actual U.S. political web site with actual facts and opinions and policy statements from an actual political party, responding to questions from actual citizens in the hope of raising the level of discourse. </p><p> The web site I'm referring to is the <a href="https://web.archive.org/web/20170628121952/https://petitions.whitehouse.gov/petitions">whitehouse.gov online petition system</a>. In short, they promise to have some senior policymaker respond to your petition, no matter how stupid, if you can get at least 25,000 people to online-sign it. (25,000 is roughly 0.008% of the population of the United States, so that seems reasonable to me to get the attention of a high-level executive.) </p><p> Note what they promise: not that they'll change anything, or that the president itself will read your message, or that the response will be <i>useful</i>. Just that they'll respond, and the response will come from some actual person that matters. The content of the response, well, you'll have to judge that for yourself. </p><p> (This reminds me of the rules for <a href="https://web.archive.org/web/20170628121952/http://www.parl.gc.ca/MarleauMontpetit/DocumentViewer.aspx?Sec=Ch22&Seq=3&Language=E">petitioning the Government of Canada</a>, except doing that only needs 25 signatures instead of 25,000. On the other hand, you're only guaranteed your petition will be <i>read</i> in parliament, and you probably won't get any response at all, other than the hope they might be thinking about it.) </p><p> So, how does it turn out? Well, I read through a few of the responses. Apparently there are 96 existing responses, which seems like a good number to me: it means the filter is blocking out the idiotic petitions (and oh boy, idiotic ones exist) but not just silencing everybody (the total number of responses is bigger than I want to read). Moreover, they sometimes combine multiple related petitions into one response (even if each one has less than 25,000 votes) and sometimes respond to petitions with less than 25,000 even though they didn't promise to do so. That tells me real people are actually reading <i>all</i> the petitions and looking for input, even though they don't have to. Moreover, there are less than 40 petitions open right now with more than 25,000 votes and no responses. Since that's less than half the total responses, that suggests to me that there's simply a time delay to answer them (which I'd expect), not that they don't take it seriously. And I doubt they're just deleting petitions they don't like, since anything that managed to get 25,000 signatures would obviously generate a major internet fuss if the signees found it missing. </p><p> So yes, the 25,000 signature threshold works, the accountability works, the promises are being kept, and there are actual answers up there. </p><p> Are the answers partisan? Of course, they're written by a political party. Are they all satisfying? No, sometimes they just avoid the question and don't bother to back up their claims, like the <a href="https://web.archive.org/web/20170628121952/https://petitions.whitehouse.gov/response/response-we-people-petition-abolishment-transportation-security-administration">Transportation Security Administration</a> one. (On the other hand, the petition itself wasn't so hot either.) </p><p> But what I <i>do</i> see is a real effort to respond in a way that really represents what the administration believes. You might not like the TSA response, but after reading it, you know exactly what their policy is about it. There are also things like the several <a href="https://web.archive.org/web/20170628121952/https://petitions.whitehouse.gov/response/removing-bottlenecks-visa-process">immigration reform responses</a> that are ultra-clear about the policy and beliefs - while admitting that, well, you kinda came to the wrong place, because the President isn't the one who sets the immigration policy. </p><p> Even the ones with a "blame the republicans" section, like the <a href="https://web.archive.org/web/20170628121952/https://petitions.whitehouse.gov/response/doubling-and-tripling-what-we-can-accomplish-space">NASA funding response</a>, do it pretty respectfully. They say "unfortunately, not everyone is supportive" and explain some problems of the alternative policy, but they do it with a tone that it encourages you to think about, and maybe talk to, your representatives to see if you can change their minds. They <i>don't</i> start from the assumption that the alternative viewpoint is idiotic and the only solution is the vote them the hell out. I can respect that. </p><p> Canada should have this. The U.S. House and Senate should have this, or at least the Democrats and the Republicans. You know what would be cool? If every party, not just the one in power, submitted a response to every petition that got 25,000 votes, to make their position clear, and we could read them side by side and decide what we believe. And if they could refrain from personal attacks and stick to the issues, like the current site does, and campaigns and TV debates generally don't. </p><p> That would be progress. </p><p class="syndicated"><a href="https://web.archive.org/web/20170628121952/http://apenwarr.ca/log/?m=201301#13">Syndicated 2013-01-13 13:08:50 (Updated 2013-01-13 14:02:10) from apenwarr - Business is Programming</a></p></div> </div> <div class="node apenwarr"> <div class="blogdate"><a name="625"><b>29 Dec 2012</b></a> <a href="/web/20170628121952/http://www.advogato.org/person/apenwarr/diary/625.html" style="text-decoration: none">»</a></div><div class="content"> <p><b>3D Printing</b></p> <p> My first 3d-printed creation (and my <a href="https://web.archive.org/web/20170628121952/http://www.3dtin.com/r12w">3d model</a> that I printed it from). The photo below is three printings of the same design at different sizes: </p><p> <img src="https://web.archive.org/web/20170628121952im_/http://apenwarr.ca/diary/cars.jpg"/></p><p> The entire car prints, bottom to top, as a single run, and yes, those wheels actually turn. Each two wheel + axle combination is a single solid object, and the frame between the two actually has closed loops around the axles. So we have the magical-seeming trick of passing the solid axles through the loops - without needing any welding or gluing after the fact. Print it, unstick it, and roll it off the platform. </p><p> Playing with this has really connected a few physical-world concepts that didn't click for me before. For example, measurement tolerances are absolutes, not percentages. The printer I used is accurate to about 0.1mm. Previously, I had never cared about any distances less than a millimeter ("If it ain't on my ruler, it don't exist") but at this scale (the smallest car is only 1cm tall), it matters. To make the loops that wrap around the axle so they don't blur into the axle itself, I had to leave quite a bit of extra space. This produced a tight fit for the smallest car, but when we naively scale up the design linearly (the big one is about 3cm tall) that excess space leaves the axles pretty floppy. (Or we can call it 4-wheel steering, and then it's a feature.) </p><p> The other scaling-related lesson comes back to that <a href="https://web.archive.org/web/20170628121952/http://online.wsj.com/article/SB10001424052970204552304577112522982505222.html">old interview question</a>: if somehow you were shrunk down to the size of a coin and put inside a blender, how would you escape? </p><p> The answer is that you'd jump out. Why? Because your body's mass scales down with n^3, but the strength in your muscles scales down much more slowly - let's say n^2. Relative to your size, you'd have super strength. That's why grasshoppers can easily leap to 100x their height, but nothing the size of a human can do so. </p><p> For the same reason, when you drop my big car on the floor, a wheel tends to break off. When you drop the little one on the floor, it stays intact. Why? Because the mass (and thus the force with which it hits the floor) has scaled up by 3^3 = 27, but the tensile strength is only about 3x higher. That's enough to break the rather weak connection between axle and wheel. That could presumably be fixed by better engineering, but I'm not really That Kind of Engineer. </p><p> In short, 3D printing is fun. But really it just makes me that much more impatient for nanobots. Ooh, micron-scale accuracy and toy cars made of diamonds! </p><p class="syndicated"><a href="https://web.archive.org/web/20170628121952/http://apenwarr.ca/log/?m=201212#29">Syndicated 2012-12-29 11:29:01 from apenwarr - Business is Programming</a></p></div> </div> <div class="node apenwarr"> <div class="blogdate"><a name="624"><b>18 Dec 2012</b></a> <a href="/web/20170628121952/http://www.advogato.org/person/apenwarr/diary/624.html" style="text-decoration: none">»</a></div><div class="content"> <p><b>Programming inside the URL string</b></p> <p> <a href="https://web.archive.org/web/20170628121952/http://afterquery.appspot.com/help">Afterquery</a> is hard to explain to people, possibly because it actually combines several pretty unusual concepts. A single unusual concept is bad enough, but several at once is likely to just leave you scratching your head. With that in mind, here's just one unusual concept: a programming language designed for URLs. Not a language for manipulating URLs; the language *is* the URL. </p><p> In afterquery, if I write this:</p><pre> http://afterquery.appspot.com?url=example.json&group=a,b,c&group=a,b </pre> <p> Then (assuming a..d are strings and e is numeric) it's about equivalent to this SQL:</p><pre> select a, b, count(c) as c, sum(d) as d, sum(e) as e from ( select a, b, c, count(d) as d, sum(e) as e from example group by a, b, c ) group by a, b </pre> <p> This gives, for each combination of a and b, the number of distinct values of c, the number of distinct combinations of (c,d), and the sum of e - each a slightly different useful aggregate. </p><p> Some people have used SQL for so long now that they don't remember anymore exactly how redundant the language is. The above SQL mentions 'e' 4 times! The afterquery code doesn't mention it at all; if you say you want to group by a, b, and c, then the assumption is that e (one of the columns in the initial dataset) is an aggregate value field and doesn't need to be mentioned unless you want an unusual aggregate. (The default aggregate is count() for non-numeric fields, and sum() for numeric fields.) </p><p> The sequence of clauses in SQL is also problematic because it's both arbitrary and restrictive. It doesn't reflect the order of operations; in reality, among other things, "from" obviously comes before "select." (Incidentally, LINQ in C# puts "from" first, for that reason.) Worse, "group by" is highly related to the select operation - whether or not something must be aggregated depends on whether it's in the "group by" clause or not, and yet "group by" is way down in the query while "select" is at the top. And worst of all, the entire "group by" clause is actually redundant: you could calculate the "group by" clause entirely by looking at which fields in the select contain aggregation functions and which don't. You know this is true, because if your select clause doesn't use aggregation functions for exactly the right fields (no more, and no less), then your query will abort with an error. </p><p> There's a lot of danger in trying to make a programming language that reads too much like English. Maybe it can be done, but you have to be tasteful about it. SQL is not tasteful; it's designed with the same mindset that produced COBOL. (The joke is that an object oriented COBOL would be called "ADD 1 TO COBOL GIVING COBOL", which is the COBOL equivalent of C++ or [incr tcl]. That line is the actual code for incrementing a number in COBOL. Compare with "SELECT sql+1 FROM sql WHERE sql=0 GROUP BY sql".) </p><p> Still though, despite SQL's insulting repetitiveness, it has one even more depressing <i>advantage:</i> it's still the most concise way to express a database query of moderate complexity. C# LINQ is the closest thing we have to a competitor - it was specifically designed to try to replace SQL because coding database queries in an imperative language is too messy - and our above nested grouping looks something like this (based on their <a href="https://web.archive.org/web/20170628121952/http://code.msdn.microsoft.com/101-LINQ-Samples-3fb9811b">sample code</a> - I haven't used LINQ in a while and haven't tested it):</p><pre> var tmp = from x in example group x by x.a, x.b, x.c into g select new { a = g.Key[0], b = g.Key[1], c = g.Key[2], d = g.Count(p => p.d), e = g.Sum(p => p.e) }; var result = from r in tmp group r by r.a, r.b into g select new { a = g.Key[0], b = g.Key[1], c = g.Count(p => p.c); d = g.Sum(p => p.d); e = g.Sum(p => p.e); }; </pre> <p> That mentions 'e' 4 times, like the SQL does, but also introduces new temporary variables x, g, r, and p. g is itself a magical complex object that includes a "Key" member we have to use, among other things. Now, LINQ is also much more powerful than SQL (you can use it for things other than databases) and in many cases it can translate itself into SQL (so it can query databases efficiently even though you wrote it imperatively), so it has redeeming features. But it definitely hasn't shortened our basic SQL query. </p><p> There are also ORMs (Object-Relational Mappers) out there, like ActiveRecord for example, which can be more concise than plain SQL. But they can't represent complicated concepts all the way through to the database. Generally you end up downloading either all the data and filtering it client-side, or one record at a time, leading to high latency, or splitting into two operations, one to get a list of keys, and another to fetch a bunch of keys in parallel. A proper "query language" like SQL doesn't require that kind of hand optimizing. </p><p> Somehow SQL has held on, since 1974, as still the best way to do that particular thing it does. You've got to give them some credit for that. </p><p> <b>Imperative or not?</b> </p><p> SQL is a programming language, although a restricted one. In my mind, I like to designate programming languages as one of three types: imperative, functional, or declarative. If you read the official definitions, functional languages are technically a subset of declarative ones, but I usually find those definitions to be more misleading than useful. HTML is a declarative language; LISP is a functional language. They're different, even if they do share underlying mathematical concepts. </p><p> Other declarative non-functional languages include CSS, XML, XQuery, JSON, regular expressions, Prolog, Excel formulas, and SQL. We can observe that declarative languages are a pretty weird bunch. They tend to share a few attributes: that it's hard to predict what the CPU will actually do (so performance depends on external knowledge of how a particular interpreter works), that expressing data works well and expressing commands works badly (I'm talking to you, ANT and MSBuild), and that explicit conditionals always look funny if they're supported at all. </p><p> These problems are very clear in the case of SQL, after using it for only a little while. The hardest part of SQL to understand is its interaction with the so-called "query optimizer" which decides what database indexes to use, and most importantly, when to use an index and when to use a full table scan. The person writing a SQL query will generally have a really clear idea when a table scan is appropriate (that is, usually for small amounts of data) and when it isn't, but there's no way in SQL to express that; you just have to trust the optimizer. It'll generally work, but sometimes it'll go crazy, and you'll have no idea what just happened to make your query go 100x slower. SQL suffers from a definition of "correctness" that doesn't include performance. </p><p> Declarative (and functional) languages also share a major advantage over imperative languages, which is that it's easy to manipulate and rearrange the program without losing its meaning, exactly because the implementation is left unspecified. For example, you should be able to convert an arbitrary SQL query to a map/reduce operation. Declarative and functional languages make things like parallelism easier to implement and reason about. (The "map" and "reduce" operations in map/reduce come from functional programming, of course.) </p><p> Let's look at afterquery again. The above afterquery, which matches the functionality of the above SQL query, can be broken down like this:</p><pre> url=example.json group=a,b,c group=a,b </pre> <p> Is it imperative, declarative, or functional? I've been thinking about it for a couple of weeks, and I don't really know. The fact that it can be mapped directly to a (declarative) SQL query suggests its declarative nature. But having written the implementation, I also know that how it works is very clearly imperative. It ends up translating the query to almost exactly this in javascript:</p><pre> grid = fetchurl('example.json') grid = groupby(grid, ['a', 'b', 'c']); grid = groupby(grid, ['a', 'b']); </pre> <p> That is, it's applying a series of changes to a single data grid (global shared state - the only state) in memory. You might notice that all the above imperative commands, though, have the same structure, so you could write the same thing functionally:</p><pre> groupby( groupby( fetchurl('example.json'), ['a', 'b', 'c']), ['a', 'b']) </pre> <p> Most imperative programs cannot be so easily mapped directly onto pure functional notation. So that leads me to my current theory, which is that afterquery really is an imperative language, but it happens to be one so weak and helpless that it can't express complex concepts (like loops) that would make it incompatible with functional/declarative interpretation. It's not turing-complete. </p><p> Nevertheless, the imperative-<i>looking</i> representation makes it easier to write and debug queries, and to estimate their performance, than declarative-looking SQL. In theory, a sequence of groups and pivots could be rearranged by a query optimizer to run faster or in parallel, but in practice, afterquery's goal of working on small amounts of data (unlike SQL, which is intended to run on large amounts) makes an optimizer pretty unnecessary, so we can just execute the program as a series of transformations in the order they're given. </p><p> <b>An imperative language in the URL string</b> </p><p> Traditionally, URLs are about as declarative as things come. At one level, they are just opaque string parameters to one of a very few functions (GET, POST, PUT, etc), some of which take an even bigger opaque string (the POST data) as a second parameter. </p><p> One level deeper, we know that URL strings contain certain well-known traditional components: protocol, hostname, path, query string (?whatever), anchor string (#whatever). Inside the query string (and sometimes the anchor string), we have key=value pairs separated by &, with special characters in the values traditionally encoded in a particular way (%-encoding). HTTP specifies the components, but it doesn't have to say anything about the structure of the query string, its key=value pairs, the & signs, or its %-encoding. </p><p> Afterquery uses the same key=value pairs as any query string, but while most apps treat them as a declarative dictionary of essentially unordered key=value pairs - with the only ordering being multiple instances of the same key - afterquery also depends on the ordering of keys. <tt>&group=a,b,c&pivot=a,b;c</tt> is a totally different command from the other way around. </p><p> Another huge constraint on a URL is its length: there is no predefined maximum length, but many browsers limit it, maybe to 1024 characters or so. Thus, if you want to keep the program stateless (no state stored between executions, and no programs stored on the server), it's important to keep things concise, so you can say what you need to say inside a single URL. </p><p> Luckily, sometimes the best art comes from constraints! Where perl-style punctuation-happy languages with lots of implicit arguments are nowadays unfashionable, we have no choice but to adopt them anyway if we want things to fit inside a URL. Our example afterquery is actually equivalent to:</p><pre> url=http://afterquery.appspot.com/example.json group=a,b,c;count(d),sum(e) group=a,b;count(c),sum(d),sum(e) </pre> <p> The URL has a default path based on the script location (as URLs always do), and the semicolon in group= statements separates the keys from the values. That leads to the very confusing at first, but very convenient thereafter, distinction between</p><pre> group=a,b,c </pre> <p> and</p><pre> group=a,b,c; </pre> <p> Which mean very different things: the first means "keep all the other columns, and guess how to aggregate their values" while the second means "throw away all the other columns." </p><p> Like a regex, it's totally unfriendly to an initial observer, where an SQL statement (or COBOL program) might make a beginner feel comfortable that they can read what's going on. But my theory is: once you've got the idea, SQL is just tedious, but afterquery is more fun. </p><p> And unlike SQL, a nontrivial program will fit in an URL string. </p><p class="syndicated"><a href="https://web.archive.org/web/20170628121952/http://apenwarr.ca/log/?m=201212#18">Syndicated 2012-12-16 04:46:12 from apenwarr - Business is Programming</a></p></div> </div> <p><a href="/web/20170628121952/http://www.advogato.org/person/apenwarr/diary.html?start=623">624 older entries...</a></p> </div></div></div><div id="col2"><div class="login"><form method="post" action="/web/20170628121952/http://www.advogato.org/acct/loginsub.html" accept-charset="UTF-8"><p><input class="in" name="u" type="text" value="" alt="user name" title="user name"/></p><p><input class="in" name="pass" type="password" value="" alt="password" title="password"/></p><input class="lsub" type="submit" value="Login"/></form></div><p align="center"><script type="text/javascript"></script><script type="text/javascript" src="https://web.archive.org/web/20170628121952js_/http://pagead2.googlesyndication.com/pagead/show_ads.js"> </script></p><div id="info"><b>New Advogato Features</b><p><b>New HTML Parser</b>: The long-awaited libxml2 based HTML parser code is live. It needs further work but already handles most markup better than the original parser.</p><p>Keep up with the latest Advogato features by reading the <a href="/web/20170628121952/http://www.advogato.org/person/robogato/diary.html">Advogato status blog</a>.</p><p>If you're a C programmer with some spare time, take a look at the <a href="https://web.archive.org/web/20170628121952/http://www.advogato.org/proj/mod_virgule/">mod_virgule project page</a> and help us with one of the tasks on the ToDo list!</p></div></div> <script src="https://web.archive.org/web/20170628121952js_/http://www.google-analytics.com/urchin.js" type="text/javascript"> </script> <script type="text/javascript"> _uacct = "UA-2968274-1"; urchinTracker(); </script> </body> </html>