**Salmon Run: Learning Mahout : Classification**

'via Blog this'

**Salmon Run: Learning Mahout : Classification**

'via Blog this'

**MpichCluster - Community Ubuntu Documentation**

Hello from processor 0 of 8

Hello from processor 1 of 8

Hello from processor 2 of 8

Hello from processor 3 of 8

Hello from processor 4 of 8

Hello from processor 5 of 8

Hello from processor 6 of 8

Hello from processor 7 of 8"

'via Blog this'

**George Dyson - President's Distinguished Lecture - Honorary Doctor of Laws - University of Victoria**

Location: University Centre Farquhar Auditorium

Ticket Information: Free admission and everyone welcome. Tickets must be reserved in advance. For ticket inquiries call 250-721-8480 or visit auditorium.uvic.ca."

'via Blog this'

**Seattle**

Seattle operates on resources such as laptops, servers, and phones, which are donated by users and institutions. The global distribution of the Seattle network provides the ability to use it in application contexts that include cloud computing, peer-to-peer networking, ubiquitous/mobile computing, and distributed systems."

'via Blog this'

**RepyTutorial – Seattle**

It is assumed that you have a basic understanding of network programming such as socket, ports, IP addresses, and etc. Also, a basic understanding of HTML is useful but not required. Lastly, you need a basic understanding of the Python programming language. If not, you might want to first read through the Python tutorial at http://www.python.org/doc/ or the python tutorial in this site. You do not need to be a Python expert to use Repy, but as Repy is a subset of Python, being able to write a simple Python program is essential.

"

'via Blog this'

**Newton Institute Seminar : van Houwelingen, JC, 17/06/2008**

'via Blog this'

**Logistic Regression**

Logistic regression is a model used for prediction of the probability of occurrence of an event. It makes use of several predictor variables that may be either numerical or categories.

Logistic regression is the standard industry workhorse that underlies many production fraud detection and advertising quality and targeting products. The Mahout implementation uses Stochastic Gradient Descent (SGD) to all large training sets to be used.

For a more detailed analysis of the approach, have a look at the thesis of Paul Komarek:

http://www.autonlab.org/autonweb/14709/version/4/part/5/data/komarek:lr_thesis.pdf?branch=main&language=en

See MAHOUT-228 for the main JIRA issue for SGD.

"

'via Blog this'

**Logistic**

In order to find the matrix B for which L is minimised, a Quasi-Newton Method is used to search for the optimized values of the m*(k-1) variables. Note that before we use the optimization procedure, we 'squeeze' the matrix B into a m*(k-1) vector. For details of the optimization procedure, please check weka.core.Optimization class.

Although original Logistic Regression does not deal with instance weights, we modify the algorithm a little bit to handle the instance weights.

For more information see:

le Cessie, S., van Houwelingen, J.C. (1992). Ridge Estimators in Logistic Regression. Applied Statistics. 41(1):191-201."

'via Blog this'

**Logistic**

There are some modifications, however, compared to the paper of leCessie and van Houwelingen(1992):

If there are k classes for n instances with m attributes, the parameter matrix B to be calculated will be an m*(k-1) matrix.

"

'via Blog this'

**WEKA - Convert from arff to csv from command line?**

'via Blog this'

java -Xmx1500m -classpath /usr/share/java/weka.jar weka.core.converters.CSVSaver -i test.arff -o test.csv

**New HTML Parser**: The long-awaited libxml2 based HTML parser
code is live. It needs further work but already handles most
markup better than the original parser.

Keep up with the latest Advogato features by reading the Advogato status blog.

If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!