mots quotidiens.

NL研のスライドが大体できた(←遅いヨ!).
ただ, 言いたいことがないわけじゃないのだけれども, 全然理論的に進歩がないし, ちょっと今回は.. ということで, 聞かなかったことにしてほしい感じ. (本当はキャンセルしたい位だったが, 申し込んだ後だったのでできなかった.)

この1月くらいこの発表があるので憂鬱だったが, これを乗り切れば自分の勉強の続きができる(という自由が奪われていないのは幸せなことである) ので, 何とかこなす予定です.

これだけでは何なので, 先月末に作った, ローカルの *.ps(.gz|Z) をブラウザの Acrobat Reader で変換して読むための簡単な script.
使い方: http://.../cgi-bin/pdfview.cgi?/home/who/fooa.ps.gz
ps2pdfに渡して何か悪さができるとは思えないけど, 一応個人用にしておいた方がいいかも.

#!/usr/local/bin/perl
#
#  PDFview.cgi
#  -- View PostScript papers in PDF on the fly.
#  Sat Dec 28 20:01:53 2002 daiti-m@is.aist-nara.ac.jp
#
use CGI;
my $zcat   = '/bin/zcat';
my $bzcat  = '/usr/bin/bzcat';
my $ps2pdf = '/usr/bin/ps2pdf';
my $cgi = new CGI;
my $file = $cgi->param('keywords');
print "Content-Type: application/pdf\n\n";

if ($file =~ /\.(?:Z|gz)$/) {
    system sprintf("$zcat %s | $ps2pdf - -", $file);
} elsif ($file =~ /\.bz2$/) {
    system sprintf("$bzcat %s | $ps2pdf - -", $file);
} else {
    system sprintf("$ps2pdf %s -", $file);
}

[n年日記]

形態論

Xerox の Lauri Karttunen 氏の "Applications of Finite-State Transducers in Natural-Language Processing".
まだちゃんと中身を読んでいないが, abstract によると

Stump's recent book, INFLECTIONAL MORPHOLOGY (2001), lays out a rich set of principles and notational conventions designed to capture important linguistic generalizations. In spite of the apparent complexity of Stump's formalism, the system as a whole is no more powerful than a collection of regular relations. Consequently, a Stump-style description of the morphology of a particular language such as Lingala or Bulgarian can be compiled into a finite-state transducer that maps the underlying lexical representations directly into the corresponding surface forms or forms, and vice versa, yielding a single lexical transducer.

らしい。いかす.
Karttunen 氏はもともと言語学の出身で, 今は Xerox europe にいるが結構お年を召したおじさんのようだ。ということに結構元気づけられたりして。
Stanford の NLP Reading group のMLのアナウンスより。

2003年01月18日(土) [n年日記]

#1 Slides

#2 -

2003年01月25日(土) [n年日記]

#1 形態論

mots quotidiens.
Daichi Mochihashi (持橋大地) daichi <at> ism.ac.jp	by hns, version 2.10-pl1.