Changeset 19760ef
- Timestamp:
- 05/15/08 21:55:02 (17 years ago)
- Branches:
- master, help
- Children:
- b012e2a
- Parents:
- 6ac84d8
- git-author:
- obrebski <obrebski@…> (05/15/08 21:55:02)
- git-committer:
- obrebski <obrebski@…> (05/15/08 21:55:02)
- Location:
- app
- Files:
-
- 7 edited
Legend:
- Unmodified
- Added
- Removed
-
app/TODO
r6ac84d8 r19760ef 2 2 * przemyslec sposob wybierania jezyka / slownika po zainstalowaniu roznych dystrybucji [PK, TO] 3 3 * gue nie sortuje wynikow, opcja weights dziala na odwrot 4 * kor nie wykonuje zamian <jednalitera> -> <dwielitery>, np. Ō rz 4 5 5 6 WAZNE: -
app/conf/kor.conf
radb4c8d r19760ef 14 14 weights = PATH_PREFIX/share/utt/weights.kor 15 15 threshold = 1.0 16 process=W -
app/conf/ser.conf
radb4c8d r19760ef 13 13 macros = PATH_PREFIX/lib/utt/terms.m4 14 14 flex-template = PATH_PREFIX/lib/utt/ser.l.template 15 tags=uam -
app/doc/utt.texinfo
r246900a r19760ef 11 11 This manual is for UAM Text Tools (version 0.90, November, 2007) 12 12 13 Copyright @copyright{} 2005, 2007 Tomasz Obr êbski, Micha³ Stolarski, Justyna Walkowska, Pawe³ Konieczka.13 Copyright @copyright{} 2005, 2007 Tomasz Obrêbski, Micha³ Stolarski, Justyna Walkowska, Pawe³ Konieczka. 14 14 15 15 Permission is granted to copy, distribute and/or modify this document … … 128 128 @item Marcin Walas 129 129 @item Justyna Walkowska 130 @item PaweÅ WereÅski 130 131 @end itemize 131 132 … … 249 250 @example 250 251 0000 00 BOS * 251 0000 07 W Piszemy lem:pisa æ,V252 0000 07 W Piszemy lem:pisaÊ,V 252 253 0007 01 S _ 253 254 0008 05 W dobre lem:dobry,ADJ … … 260 261 0024 11 W Warszawiacy lem:Warszawiak,N 261 262 0035 01 S _ 262 0036 03 W te ¿263 0036 03 W te¿ 263 264 0039 01 P . 264 265 0040 00 EOS * … … 268 269 @example 269 270 0000 BOS * 270 0000 W Piszemy lem:pisa æ,V271 0000 W Piszemy lem:pisaÊ,V 271 272 0007 S _ 272 273 0008 W dobre lem:dobry,ADJ … … 281 282 @example 282 283 0000 BOS * 283 W Piszemy lem:pisa æ,V284 W Piszemy lem:pisaÊ,V 284 285 S _ 285 286 W dobre lem:dobry,ADJ … … 292 293 W Warszawiacy lem:Warszawiak,N 293 294 S _ 294 W te ¿295 W te¿ 295 296 P . 296 297 EOS * … … 406 407 407 408 408 @c [JAK UZYSKA ÆPOLSKIE CZCIONKI W DVI???]409 @c [JAK UZYSKAà POLSKIE CZCIONKI W DVI???] 409 410 410 411 @macro parhelp … … 719 720 720 721 @multitable {aaaaaaaaaaaaaaaaaaaaaaaaa} {aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa} 721 @item @strong{Authors:} @tab Tomasz Obr êbski722 @item @strong{Authors:} @tab Tomasz Obrêbski 722 723 @item @strong{Component category:} @tab source 723 724 @end multitable … … 821 822 @c @chapter sen - sentencizer 822 823 823 @c Authors: Tomasz Obr êbski824 @c Authors: Tomasz Obrêbski 824 825 825 826 @c --------------------------------------------------------------------- … … 832 833 833 834 @multitable {aaaaaaaaaaaaaaaaaaaaaaaaa} {aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa} 834 @item @strong{Authors:} @tab Tomasz Obr êbski, Micha³ Stolarski835 @item @strong{Authors:} @tab Tomasz Obrêbski, Micha³ Stolarski 835 836 @item @strong{Component category:} @tab filter 836 837 @end multitable … … 933 934 934 935 @example 935 0000 07 W Piszemy lem:pisa æ,V/AiVpMdTrfNpP1936 0000 07 W Piszemy lem:pisaÊ,V/AiVpMdTrfNpP1 936 937 0007 01 B _ 937 938 0008 05 W dobre lem:dobry,ADJ/DpNpCnavGaifn … … 948 949 949 950 @example 950 0000 07 W Piszemy lem:pisa æ,V/AiVpMdTrfNpP1951 0000 07 W Piszemy lem:pisaÊ,V/AiVpMdTrfNpP1 951 952 0007 01 S _ 952 953 0008 05 W dobre lem:dobry,ADJ/DpNpCnavGaifn lem:dobry,ADJ/DpNsCnavGn … … 960 961 961 962 @example 962 0000 07 W Piszemy lem:pisa æ,V/AiVpMdTrfNpP1963 0000 07 W Piszemy lem:pisaÊ,V/AiVpMdTrfNpP1 963 964 0007 01 S _ 964 965 0008 05 W dobre lem:dobry,ADJ/DpNpCnavGaifn,ADJ/DpNsCnavGn … … 994 995 string @code{<add1>}, replace suffix of length @code{<cut2>} with string 995 996 @code{<add2>}. For example @code{3t} transforms @samp{kocie} into 996 @samp{kot}, @code{3-4a ³y} transforms @samp{najbielsi} into @samp{bia³y}997 @samp{kot}, @code{3-4a³y} transforms @samp{najbielsi} into @samp{bia³y} 997 998 998 999 Each dictionary entry must be written in one line and must not contain blank characters. … … 1005 1006 kotem;2,N/GaNsCi 1006 1007 kocie;3t,N/GaNsCl;3t,N/GaNsCv 1007 najbielsi;3-4a ³y,ADJ/DsNpCnGp1008 najbielsze;3-5a ³y,ADJ/DsNpCnGaifn1008 najbielsi;3-4a³y,ADJ/DsNpCnGp 1009 najbielsze;3-5a³y,ADJ/DsNpCnGaifn 1009 1010 najlepsi;dobry,ADJ/DsNpCnGp 1010 1011 najlepsze;dobry,ADJ/DsNpCnGaifn … … 1065 1066 @multitable {aaaaaaaaaaaaaaaaaaaaaaaaa} {aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa} 1066 1067 1067 @item @strong{Authors:} @tab Micha ³ Stolarski, Tomasz Obrêbski1068 @item @strong{Authors:} @tab Micha³ Stolarski, Tomasz Obrêbski 1068 1069 @item @strong{Component category:} @tab filter 1069 1070 … … 1156 1157 1157 1158 1158 Example: @code{3-4a ³y} transforms @i{najbielsi} into @i{bia³y}1159 Example: @code{3-4a³y} transforms @i{najbielsi} into @i{bia³y} 1159 1160 1160 1161 … … 1165 1166 1166 1167 @example 1167 * ³kê;1a,N/GfNsCa1168 naj*elszy;3-4a ³y,ADJ/...:...1168 *³kê;1a,N/GfNsCa 1169 naj*elszy;3-4a³y,ADJ/...:... 1169 1170 @end example 1170 1171 … … 1179 1180 1180 1181 @multitable {aaaaaaaaaaaaaaaaaaaaaaaaa} {aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa} 1181 @item @strong{Authors:} @tab Tomasz Obr êbski, Micha³ Stolarski1182 @item @strong{Authors:} @tab Tomasz Obrêbski, Micha³ Stolarski 1182 1183 @item @strong{Component category:} @tab filter 1183 1184 @end multitable … … 1248 1249 @multitable {aaaaaaaaaaaaaaaaaaaaaaaaa} {aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa} 1249 1250 1250 @item @strong{Authors:} @tab Tomasz Obr êbski1251 @item @strong{Authors:} @tab Tomasz Obrêbski 1251 1252 @item @strong{Component category:} @tab filter 1252 1253 … … 1268 1269 1269 1270 input: 1270 0000 05 W Cze ¶æ1271 0000 05 W Cze¶Ê 1271 1272 0005 01 P ! 1272 1273 0006 01 S _ … … 1279 1280 output: 1280 1281 0000 00 BOS * 1281 0000 05 W Cze ¶æ1282 0000 05 W Cze¶Ê 1282 1283 0005 01 P ! 1283 1284 0006 00 EOS * … … 1300 1301 @c @chapter gph - graphizer 1301 1302 1302 @c Authors: Tomasz Obr êbski1303 @c Authors: Tomasz Obrêbski 1303 1304 1304 1305 … … 1313 1314 1314 1315 @multitable {aaaaaaaaaaaaaaaaaaaaaaaaa} {aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa} 1315 @item @strong{Authors:} @tab Tomasz Obr êbski1316 @item @strong{Authors:} @tab Tomasz Obrêbski 1316 1317 @item @strong{Component category:} @tab filter 1317 1318 @end multitable … … 1541 1542 1542 1543 @multitable {aaaaaaaaaaaaaaaaaaaaaaaaa} {aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa} 1543 @item @strong{Authors:} @tab Tomasz Obr êbski1544 @item @strong{Authors:} @tab Tomasz Obrêbski 1544 1545 @item @strong{Component category:} @tab filter 1545 1546 @end multitable … … 1635 1636 @section kot - untokenizer 1636 1637 1637 Authors: Tomasz Obr êbski1638 Authors: Tomasz Obrêbski 1638 1639 1639 1640 @command{kot} is the opposite of @command{tok}. It changes UTT-formatted text into plain text. … … 1850 1851 1851 1852 @multitable {aaaaaaaaaaaaaaaaaaaaaaaaa} {aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa} 1852 @item @strong{Authors:} @tab Tomasz Obr êbski1853 @item @strong{Authors:} @tab Tomasz Obrêbski 1853 1854 @item @strong{Component category:} @tab filter 1854 1855 @end multitable … … 1889 1890 1890 1891 @multitable {aaaaaaaaaaaaaaaaaaaaaaaaa} {aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa} 1891 @item @strong{Authors:} @tab Tomasz Obr êbski1892 @item @strong{Authors:} @tab Tomasz Obrêbski 1892 1893 @item @strong{Component category:} @tab filter 1893 1894 @end multitable -
app/src/dgp/tre.rb
radb4c8d r19760ef 1 1 #!/usr/bin/ruby -I /usr/local/lib/utt -I $HOME/.local/lib/utt 2 2 3 $: << "#{ENV['HOME']}/.local/lib/utt" 4 $: << "/usr/local/lib/utt" 5 3 6 require 'getoptlong' 7 require 'seg.rb' 4 8 5 9 opts = GetoptLong.new( … … 60 64 end 61 65 end 62 63 #require File.expand_path(File.dirname(__FILE__) + "../lib/utt/seg.rb")64 require 'seg.rb'65 66 66 67 $dgpsep=';' -
app/src/gue/cmdline_guess.ggo
r8d3e6ab r19760ef 8 8 option "dictionary" d "File with dictionary information" string typestr="filename" default="gue.bin" no 9 9 option "per-info" v "Display performance information" flag off 10 option "weights" w "Print weights" flag off hidden10 option "weights" w "Print weights" flag off 11 11 option "no-uppercase" - "Do not process form containing uppercase letters" flag off 12 12 -
app/src/gue/common_guess.cc
r6ac84d8 r19760ef 8 8 char dictionary[255]; 9 9 bool per_info=false; 10 bool weights= true;10 bool weights=false; 11 11 12 12 void process_guess_options(gengetopt_args_info* args) … … 56 56 57 57 if(args->weights_given) 58 weights= false;58 weights=true; 59 59 60 60 }
Note: See TracChangeset
for help on using the changeset viewer.