[hts-users:01924] Parallel training by HERest
- Subject: [hts-users:01924] Parallel training by HERest
- From: tshlmail-hts@xxxxxxxxx
- Date: Wed, 1 Apr 2009 15:20:12 -0700 (PDT)
- Delivered-to: hts-users@xxxxxxxxxxxxxxx
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1238624412; bh=SBnEG2yvVU1ASaflsk/8RdrCEV03XF010W9X+ul4gg8=; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Reply-To:Subject:To:MIME-Version:Content-Type:Content-Transfer-Encoding; b=Os1M5u+mZUicjiVYbMzAyxzGGv8vCtdCSeuw606kDv6jsOmETU4JUyJ718EanSMre3dNNYP+vkd5lNp3U2Qz+070b5bJtpZdComET5JXFF6ve4/R7gallb1hSjUe2cKUdRQlYsr9iOd8sNnrXsv8H/pxIaZXf8GSg/4okxBmVrw=
- Domainkey-signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Reply-To:Subject:To:MIME-Version:Content-Type:Content-Transfer-Encoding; b=s/eKhJAZ35wttCU1hCZcw6+64mPJaL/xEWQln8ul9JLyLOjx9FzCBD4gzlb1XsL1L5Z5ZpQfSCoTqVgWiZTgZRsvsLv+ynNkzWyJ7yhLr4IgoPjY7vdIT6fz5nuXBHv/84sB+1MleVvhhqGQEShoIz/gDMjO4fp7brZ/pjwpAFI=;
As far as I know, HERest supports parallel training as follows:
# non-parallel:
HERest -S trlist -I labs -H dir1/hmacs -M dir2 hmmlist
# parallel (equivalent to the above command)
HERest -S trlist1 -I labs -H dir1/hmacs -M dir2 -p 1 hmmlist # Part 1
HERest -S trlist2 -I labs -H dir1/hmacs -M dir2 -p 2 hmmlist # Part 2
HERest -H dir1/hmacs -M dir2 -p 0 hmmlist dir2/*.acc # Merging
It is obvious that only the training set is split.
My current problem is that I have a huge model file (i.e. dir1/hmacs being huge) and insufficient memory. So I am going to split the huge model file as well into small ones, such that each of them corresponds to a partial training list (i.e. trlist*). In other words, I wonder if the following commands work:
HERest -S trlist1 -I labs1 -H dir1/hmacs1 -M dir2 -p 1 hmmlist1 # Part 1
HERest -S trlist2 -I labs2 -H dir1/hmacs2 -M dir2 -p 2 hmmlist2 # Part 2
HERest -H dir1/hmacs -M dir2 -p 0 hmmlist dir2/*.acc # Merging
I tried this way today but got an error numbered +7150. I am not sure whether HERest supports splitting not only a training set but also a corresponding model file.
By the way, my training set cannot be split into two smaller parts between which there is no label overlap.
Thank you very much!
Hui LIANG
- Follow-Ups
-
- [hts-users:01925] Re: Parallel training by HERest, Heiga ZEN (Byung Ha CHUN)