adubey is currently certified at Journeyer level.

Name: Amit Dubey
Member since: 2000-07-19 03:48:40
Last Login: 2011-01-19 10:38:07

FOAF RDF Share This



I'm working on a homepage, but I have project reports and a short vacation (thankfully!) coming up, so it'll be a couple months before it's ready. About myself: I'm a masters student in artificial intelligence. My thesis concerns (or should I say, "will likely concern?") language acquisition - the idea that it will be possible for computers to understand human language given a sufficiently marked-up training set to learn from.

I'm interested in approaching this problem from a human-computer interaction viewpoint. In other words, rather than make a program try to (poorly?) learn a whole language, instead I hope to learn a "sub-language" that deals with a particular task, and learn that language well.

My other interests, perhaps unsurprisingly, relate to other kinds of languages: programming languages (and type theory), formal languages (and the theory of computation) and statistical languages (and statistical approaches to AI).


Recent blog entries by adubey

Syndication: RSS 2.0

Random thoughts... does anyone else find advogato's interface a bit lacking?

Often, I find I have to click through 2-3 pages to do common things like post a diary entry. In general, I feel like it's harder than it should be to do the things I want to do, or that I do often. While this feeling might go away after I get used to it, perhaps it isn't as welcoming to new users.

Other random thoughts... sometimes people post diary entries that have interesting ideas that are worth discussing. While email is a suitable route for that, wouldn't it be cool if you could reply to those?

Even more random... first there were linear news readers, then there were "threaded" news readers. These are essentially trees, where each node has one parent. Now, the web does threaded discussions one up: we have directed graphs instead of trees. But short of a WikiWikiWeb-type CGI program, the web is geared to "read this" rather than "comment on this". Mightn't it be cool if you had some kind of "threaded" system where posts may have more than one parent? Someplace where you could bring separate discussions that are mutually relevant together instead of forever splitting them. For example, a discussion hanging off an article with one hanging off a diary entry. Ah... if I only I had the time... anyways, I'm getting far too random for my own good.

Anyways, my prob parser is stuck right now; the training data is in an ass-backwards format in which words aren't necessarily given a part-of-speech. In other words, rules are in the form Nonterminal->(Terminal|Nonterminal)* rather than Nonterminal->Terminal* | Nonterminal->Nonterminal*. Of course, I could split things up myself (by putting the grammar in CNF), but then there I will loose some generalizations (ie in AP-> NP and NP | AP-> VP and VP NP, each "and" will get a different non-terminal, but I want only one. Sometimes the POS will be different, so I can't say "always make 'and' an adjunct".) Grr... I could use a part-of-speech tagger, but then I have to 1) link the 'C' POS tagger to ML or 2) play around with the training data's nonterminals to be compatible with the tagger's tags. Alternatively, I could shell out $2500 for a good training set...

This is my first diary entry. I've just added three "projects" although I'm only actively working on two of them.

They are Simulus (a game), a really bad first attempt at making a speech interface to a software application - in this case gnumeric (this is the one that I've abandonded for now) and finally, my probabilistic parser.

My short-term goal is getting the parser to a workable state. When that's done I'll probably post another diary entry. I don't intent to touch the game until september.


adubey certified others as follows:

  • adubey certified jrf as Journeyer
  • adubey certified welisc as Journeyer
  • adubey certified mobius as Journeyer
  • adubey certified manu as Journeyer
  • adubey certified nymia as Apprentice
  • adubey certified zeevon as Apprentice
  • adubey certified pjones as Journeyer
  • adubey certified TheCorruptor as Apprentice
  • adubey certified kgb as Master
  • adubey certified adubey as Journeyer
  • adubey certified rw2 as Journeyer
  • adubey certified emk as Journeyer
  • adubey certified bownie as Journeyer

Others have certified adubey as follows:

  • jlf certified adubey as Apprentice
  • welisc certified adubey as Journeyer
  • nixnut certified adubey as Apprentice
  • adubey certified adubey as Journeyer
  • rw2 certified adubey as Apprentice
  • jrf certified adubey as Journeyer
  • TheCorruptor certified adubey as Journeyer
  • jLoki certified adubey as Journeyer

[ Certification disabled because you're not logged in. ]

New Advogato Features

New HTML Parser: The long-awaited libxml2 based HTML parser code is live. It needs further work but already handles most markup better than the original parser.

Keep up with the latest Advogato features by reading the Advogato status blog.

If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!

Share this page