Why is the Norwegian speech-to-text using "særskriving" and not correct grammar? - speech-to-text

When I say compound words to the service, it will almost instinctively add spaces between words that should not be there, as if it was the rule itself. An example is the Norwegian word for console window, which is "konsollvindu" written correctly, but the speech-to-text service writes it as the incorrect "konsoll vindu" instead.
I have tried numerous places to get in contact with someone at Microsoft to fix this, as I am a Norwegian, and a linguist by degree. I can help fix these issues if you don't have anyone internally on this.
This is extremely important to fix. So please, fix this, or don't include Norwegian publicly for now.

According to mr. Lin over at Cognitive Uservoice this issue will be addressed within 3-4 weeks. Thank you to all who helped! :3

Related

How can I make text appear on scrolling like the following website

Can someone help me with the following problem. I want to make text appear and flow to the right at the same time like this website: https://www.wokine.com/en/ . How should my code look like?
There are a series of solutions to that and writing about them all in detail would take forever, I'm just going to share a few of my favourite libraries and a few resources on them.
AOS is in my opinion the nicest library doing what you want to do. It's on Github.
WOW.js is another library that does what you are looking for, but it has some strange licensing.
With a queck search I could also find ScrollMagic, I have no experience with that one.
Hope it helps get you started!

Is there a publish add-on to translate google slides?

I am a history teacher and have been working to assist students learning English with class materials and work along with many of my co-workers. Although it has its issues, google translate is incredibly helpful. I had been looking for ways to translate slides efficiently when I found the "Quickstart: Translate add-on for Google Slides" page of G Suite Developers. Link Below.
I know a bit about code and was able to quickly follow the instructions (which were excellent) to set up the add on, but many of my co-workers may struggle with adding it to their slides.
Long story short here is the question:
I am wondering if there is an already published version of the page/Quickstart Translate add-on which people can use instead of setting up the add on manually? I have not been able to find one.
I'm running into the same problem and have found the same document as you. Have you thought about publishing the add-on to your domain only. You can do that and the link you have links to the documentation on how to do so. You may need to work with the Google Admin in your district to make it happen though. Good luck!

How to automatically detect code snippet from a text sample?

I'm doing some analysis on GitHub comments. But for that, I need to exclude the code samples and error messages from the comments automatically from a large set.
The other easier way to say this would be, I can keep only the English part of the comments. Although there are few libraries to detect the language of a sentence, there are few challenges in my case too. 1) the comment part does not always follow proper English grammar, 2) the code sample and error message mainly consist of English words too.
So what should be my best approach. The results don't need to be 100% accurate, I just want to know the best approach that can give me a satisfactory result at least. Any idea?
This question is old, but my Google search led me to this question; so offering this answer in case anyone stumbles into this question, too.

Semantics based code search

We have a large number of repositories. We want to implement a semantics(functionality) based code search on those repositories. Right now, we already have implemented keyword based code search in which we crawled through all the repository files and indexed them using elasticsearch. But that doesn't solve our problem as some of the repositories are poorly commented and documented, thus searching for specific codes/libraries become difficult.
So my question is: Is there any opensource libraries or any previous work done in this field which could help us index the semantics of the repository files, so that searching the code becomes easy and this would also help us in re-usability of the codes. I have found some research papers like Semantic code browsing, Semantics-based code search etc. but were of no use as there was no actual implementation given. So can you please suggest some good libraries or projects which could help me in achieving the same.
P.S:-Moreover, companies like Koders, Google, cocycles.com etc. started their code search based on functionality. But most of them have shut down their operations without giving any proper feedback, can anyone please tell me what kind of difficulties they are facing.
not sure if this is what you're looking for, but I wrote https://github.com/google/zoekt , which uses ctags-based understanding of code to improve ranking.
Take a look at insight.io
It provides semantic search and browsing

Search for VMS documentation tool (VAX document ?)

Many years ago I worked at a DEC-shop. We used a tool called Document (as far as I remember) to create documentation. It was provided by DEC and created the same layout as the original DEC documentation. Which is as far as I'm concerned a milestone in layout and typesetting.
Researching the web I found a more or less obscure company which sells this tool for Open VMS. But I would prefer an open source replacement.
Any help ?
Greetings Till
Touch Technology was, and perhaps still is, an interesting company with interesting folks like 'Mr Dan'.
They picked up a good bit of Digital software in a fire-sale and had some good stuff such themselves such as performance tuning tools and a 4GL (Intouch... available on OpenVMS Freeware).
The company appears to have moved one, judging by their current website front door which does not dwell on the old stuff , but you could do worse than try contact them.
The back door still list DECdocument: http://www.ttinet.com/documentation.html
Good luck!
Hein
If you're still looking for a solution, have you thought about LaTeX? The markup syntax isn't radically different from VAX DOCUMENT's SDML. They both have the same back-end; the final steps in processing an SDML file involved running it through TeX.
I would think the best solution would be DocBook, since it is also an SGML-ish format. You might be able to translate a substantial portion using XSS.

Resources