Install another language in deepspeech with node js - node.js

I am trying to use mozilla-deepspeech in my project to generate text to voice. How do i use it for another language than English? I want to convert to Indonesia language.
Note: I am using NodeJS as a base project on windows.
Thank you for your solution

You need an Indonesian model that is downloadable. Check this list.
Simply load another model and scorer via the API and provide the audio.

Related

RASA how to use Japanese (Tokennization-Mecab)

RASA is known to be an effective bots framework.
Stack such as RASA NLU and RASA Core is really useful.
I hand-on it around, I find out that its amazing especially with English text. I give another try on Japanese Text (Alpha support of spacy). I used it with tensorflow pipeline, I got stuck, I cannot figure out how to use external tokenization such as Mecab.
Anyone experience it ??
The tensorflow pipeline works with any language that’s whitespace tokenized. As it is not the case with Japanse, you have to built your own tokenizer.
You can do so by extending the classes Tokenizer and Component, e,g.:
class MecabTokenizer(Tokenizer, Component):
# fill with your code
You can then use your custom class in your NLU pipeline by specifying the module path in the name (also described in the docs), e.g.:
pipeline:
- name: "path.to.MecabTokenizer"
# other components
Somebody tried something similar here, maybe you can use this or take is some sort of template.
I have added a custom component using Mecab tokenizer. It works fine for me for Japanese text.
Link: Rasa_Japanese

I want to use The GATE Predicate-Argument EXtractor Component (PAX)

I want to use The GATE Predicate-Argument EXtractor Component (PAX), but I can't figure out how to load the plugin on GATE developer (ver. 7)
Please help me.
How do I load it?
The MultiPaX plugin is a bit of a complex beast. You need to download the package from the homepage, unpack it (use something like 7-zip if you're on Windows) and then build it using Ant, which you will need to install separately - GATE used to bundle a copy of Ant but that is no longer the case since version 7. Once you have the plugin compiled and packaged you should be able to load it through the plugin manager the same as any other plugin.
However, to get anything useful out of the PR you need to build quite a complex pipeline including at least one of SUPPLE, MiniPar or the Stanford Parser PR. Getting these to work is itself a non-trivial task...
The predicate argument extractor is not developed by the core GATE team, so specific questions are probably better posted in the semanticsoftware.info forum (linked from the bottom of http://www.semanticsoftware.info/pax) rather than the usual GATE user list.

Text to Speech (TTS) software for Scripts WAV or MP3 ouput

I asked this question on SuperUser, but it's fallen on deaf ears. Hopefully I can get more of an audience here.
I'm looking for a low cost (or Free) solution like ScriptVox only with a better engine. That is, to read in a script and assign characters to voice. I've read the post here but even with those I'd have to concatenate wav files. It's not that I don't love Audacity, but it is time consuming. I am halfway thinking of writing my own, but I'm sure there has to be a solution out there. Any suggestions?
I would use Microsoft's Text-to-Speech engine. They have a simple example on how to do exactly what you're looking for:
http://msdn.microsoft.com/en-us/library/ms717065(v=vs.85).aspx
With that sample code, you can speak some text and have it dumped to a WAV file. From there, if you need to convert to a format such as MP3, you can use FFMPEG.
Brad's answer is pretty terrific, as it contains exactly what you're looking for. However, it's missing one fundament you'd expressed a preference for in the question errata: an implementation in C#.
Here's a full tutorial to gain access to the Speech API in managed code. With full credit to Blake Niemyjski and the appropriate teams at Microsoft, here's the salient bits, because the linkback to the original article is dead and this appears to be borrowed from Microsoft directly:
The following link (Giving Computers a Voice) will lead you to a
Microsoft site that will show you how to create a project and get a
basic text to speech application up and running in VB .Net or c# in no
time!
SAPI
SAPI is the speech API that gives applications access to speech
recognition and text-to-speech (TTS) engines. This article focuses on
TTS. For TTS, SAPI takes text as input and uses the TTS engine to
output that text as spoken audio. This is the same technology used by
the Windows accessibility tool, Narrator. Every version of Windows
since XP has shipped with SAPI and an English TTS engine.
TTS puts user's ears to work. It allows applications to send
information to the user without requiring the user's eyes or hands.
This is a very powerful output option that isn't often utilized on
PCs.
Three steps are needed to use TTS in a managed application:
Create an interop DLL
Since SAPI is a COM component, an interop DLL is needed to use it from
a managed app. To create this, open the project in Visual Studio.
Select the Project menu and click Add Reference. Select the COM tab,
select "Microsoft Speech Object Library" in the list, and click OK.
These steps add this reference to your project and create an
Interop.SpeechLib.dll in the same folder as your executable. This
interop DLL must always be in the same folder as your .exe to work
correctly.
Reference the interop namespace
Include this namespace in your application. In C#, add "using
SpeechLib;"; iIn VB, add “Imports SpeechLib”.
call Speak()
Create a SpVoice object and call Speak():
Visual C#
SpVoice voice = new SpVoice();
voice.Speak("Hello World!", SpeechVoiceSpeakFlags.SVSFDefault);
Visual Basic
voice = New SpVoice
voice.Speak("Hello World!", SpeechVoiceSpeakFlags.SVSFDefault)
I feel Brad's answer led me to the correct solution here (thus, he's more deserving of credit than I), but this should be the last piece you were missing. You should now be able to replicate the WAV-file writing from the C++ solution in managed code, and from there, transcode into your desired format.
If having the program access internet is acceptable, then you could use iSpeech.
You can use their API, but unfortunately it is limited to 200 uses/day.
Their API also allows appending format=(wav|mp3) following a query, allowing you to get your sound in both desired formats.
http://en.wikipedia.org/wiki/Comparison_of_speech_synthesizers
That's all I've got.
Google translate uses eSpeak http://support.google.com/translate/

how to create a check box in j2me?

I am developing my first project. I have to create a check box and based on the selection it should give a output.
use "J2ME:Complete Reference". Best book for begineers. as i also started learning J2ME recently,i found it very useful.It is easily available for download here.

Creating PDF Invoices - Are there any templating solutions? [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a book, tool, software library, tutorial or other off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 8 years ago.
Improve this question
Our company is looking to integrate invoices into a new system we are developing.
We require a solution to create a layout of the invoice and then convert to pdf.
We have considered just laying out the invoice in html/css then converting to pdf.
We have also considered using SVG->PDf conversion.
Both of these solutions integrate well into our existing templating language used for our web application.
Historically we have been a Microsoft based business and used Crystal Reports for such a task but we are looking for an open source Linux solution for this project.
Does any one have any suggestions of an approach or technology we could use for such a task?
Try this... create a blank invoice with Word (or whatever you want) and save it as a PDF.
Then use a PDF library to modify the PDF (insert the text at particular coordinates). We do this in the Microsoft world and it is extremely easy.
The biggest benefit is that we can use our own tools to create and modify the template. If we want to add some static text, we just crank open Word, make the change and save it to a PDF file (that is being used as a template).
For Microsoft, we use iTextSharp which is actually a C# port of the original Java version of iText
Additionally...
You can use Adobe Acrobat to insert fields in the PDF (address, phone, invoice number, line item 1, line item 2, etc...) and then use iText/iTextSharp to populate these fields at run time.
This is, in more detail, what we do... and it is extremely easy.
The normal way is to install (La)TeX (probably already on the linux box) and run pdflatex to get the pdfs. You can also use Apache FOP, if you prefer xslt and xsl-fo.
If the number of invoices to create is low you might want to use open-office (directly or as a toolkit).
If you want high-precision positioning and low-level access, a low-level pdf library (I don't know if iTextSharp works with mono) might be what you want.
I would try out LaTeX first, because it allows you to get results with the least effort.
I've previously produced invoices by templating a PostScript file, and then using Ghostscript's ps2pdf to convert those into PDFs.
We use Reportlab with Python. If you look around there are a load of ready-made forms/invoices/etc.
There are several OSS reporting engines (Jasper Reports, Pentaho and BIRT to name three) that you could use in much the same way as you have historically used Crystal Reports. One of the other posters mentions ReportLab, which is an option if you're using Python or can embed a Python runtime in your application.
Probably the most flexible solution is to create XMLs with invoice data and then by using XSLTs transform the, into PDFs, HTMls, whatever...
It depends on your environment. If you have access to Java, you might look at iText (http://www.lowagie.com/iText/), a library that allows you to generate PDF files on the fly.
There are two steps, if i understood correctly:
1) Creation of PDF template with placeholders to populate data programmatically
2) Populating the PDF template programmatically during run time
For #1, OpenOffice allows creation of PDF templates, which can then be populated programmatically. It's good enough to create simple invoices that doesn't probably involve datagrid/table kind of stuff.
For #2, you already have the answers here - iText, iTextSharp.
Hope this helps!
I love wkhtmltopdf http://code.google.com/p/wkhtmltopdf/
Not sure what your goal is here, but there is an opensource php-library called fpdf, which also has an extension for taking a pre-made pdf as layout and then populate it with more content, generating a new PDF with that info.
However, I would go for a solution that you can integrate nicely into the plattform you're building, but I wouldn't go in a HTML->PDF solution since you won't have any clue about what would fit on a piece of paper regarding sizes in that kind of enviroment, meaning you won't know when you should split the content into two separate templates.
You might also try using XSL:FO. XSL:FO is a documented standard for describing page layout: http://www.w3.org/TR/xsl/#fo-section.
I've had success on two projects creating documents by creating an XML schema that defines the content of the "PDF". I then use the XSD tool (from Microsoft) to generate a class representing this document. I then map my data into that structure, serialize the populated class to XML, along with an XSL stylesheet that defines how that data should be mapped into FO, and pass it to an FO formatter. For formatters, I have use Alt-Soft's Xml2Pdf with success. There are a few others out there. There are some tools available to help create the XSL to FO stylesheet (i.e. stylusstudio and XmlSpy), but I recommend learning the FO constructs as the tools seem to produce bloated stylesheets. FO is comparable to HTML (where a P tag is a BLOCK tag in FO), but can be tricky. This nice thing about FO, is that some formatter support conversion to other formats, such as Word, HTML, etc.
Other options:
iTextSharp (C# port of iText). Just started reading about this. Open source and free. I don't think there is any "templating" supported with this, but I could be wrong about that.
SQL Server Reporting Services. Assuming your invoice data is in, or can be put in, a format that can be read by reporting services (SQL Server, Web Service, etc), define the layout in SSRS and then publish to reporting server. Use SSRS Web Services or query parameter execution to execute the report and have it output as PDF.
This html-2-pdf site may be a helpful starting point: http://maarten.lippmann.us/?p=101
A site a friend of mine built uses a script to churn HTML pages into printable PDFs, too - http://philambdaupsilon.org. Not sure on the exact details of it, but he is an SO user, and I'll send this question to him, too.
Unfortunately, the best system on the market (at present) is passing the HTML & CSS to a ColdFusion server and have that return the rendered PDF. So if money isn't a big concern, this is the quickest to deploy solution that'll render the best results.
I've tried very hard to get FPDF, TCPDF, the R&OS pdf class, and even CodeIgniter's recommendation to work, but nothing with stable output for anything beyond the most basic/bland HTML files.
Honestly, if the ColdFusion solution isn't viable, I'd use html2ps, and then ps2pdf to convert your files into a PDF.
(This is all assuming that you don't want to take the time and design each PDF using the native PDF-creator code in PHP. This is what systems like SugarCRM use. Though its very functional with stable results, the actual creation of each PDF-generator file is a most painful process)
We have used Jasper Reports before. It's not what you'd call user-friendly, but it will talk directly to your database.
html2pdf works very well. You can use this to generate both HTML and PDF reports from the same source.
I'm fiddling with Black Sheep Invoices right now, which is great at first but now I'm having trouble actually getting it to render the PDFs. Lots of installation difficulties--probably a lot easier on your own server but i'm up on a shared host with it. The HTML output and data management portions are well done though, which is something you won't get out of just creating a postscript template. I was hoping to find a reference to a library that has an active development team though (Black Sheep is not being updated at this time).
If you want browser perfect HTML converted to PDF then try commandlineprint
You'll need to install firefox on a linux distro, disable all firefox alerts and then run it through a virtual display. Check this thread for more details.
It's infuriating to get running well but does give you the best results for HTML to PDF conversion I've seen.
OK, a search of Google Code projects turned up Simple Invoices, which is awesome and well maintained.
I use TROFF for my invoices because of its extremely simple textual encoding. The logic is a few lines of Perl. Keeping it simple.
For a Ruby solution, try Prawn: http://prawn.majesticseacreature.com/
I use open office on the server and then generate the XML for the document (just unzip the document and hack away)
Some can use Dhek template editor to define area/placeholder for existing PDF, without altering existing document, and then populate it to generate final doc (e.g. with user values from a form): https://github.com/applicius/dhek .

Resources