Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
I want to determine which part of audio file contain speech or music.
I hope someone has a made something like this or can tell me where to start.
Can you please suggest some method/tutorial for doing the same.
Thank you.
Check out the pyAudioAnalysis python library. Among others, it has a pre-trained speech-music classifier and two segmentation-classification methods (one based on fix-sized windows and another based on HMMs).
You can extract speech and music parts of an audio recording quite easily, e.g.:
from pyAudioAnalysis import audioSegmentation as aS
[flagsInd, classesAll, acc] = aS.mtFileClassification("data/scottish.wav", "data/svmSM", "svm", True, 'data/scottish.segments')
with a result as the one in this image
There's lots of prior art in this area, but I'd suggest browsing through some of Dan Ellis's papers. The slides for this talk has some good background. In short it's all down to picking the right feature vectors.
Related
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 2 years ago.
Improve this question
Hello i haven't been using EAGLE for a while now and had mostly forgot where to get any good and complete library of basic parts like resistors, LEDs, transistors... I have tried to find a library i need on EAGLE web page, but i haven't found any, that would offer quite large amount of basic parts.
If anyone could point me to a library with a good and large set of basic parts he would really save my day.
The Sparkfun Eagle libraries are quite good. Download at https://github.com/sparkfun/SparkFun-Eagle-Libraries
dear you can use "ORCAD" software rather than using EAGLE as it is easy in use and easy availability of its libraries on net.
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
I am doing a project in news classification. Basically the system will classifying news articles based on the pre-defined topic (e.g. sports, politic, international). To build the system, I need free data sets for training the system.
So far, after few hours googling and links from here the only suitable data sets I could find is this. While this will hopefully enough, I think I will try to find more.
Note that the data sets I want:
Contains full news articles, not just title
Is in English
In .txt format,not XML or db
Can anybody help me?
Have you tried to use Reuters21578? It is the most common dataset for text classification. It is formated in SGML, but it is quite simple to parse and transform to a txt format.
You can build it, you can write a Python/Perl/PHP script where you run a search, then when you find the answers you can isolate the attributes with regex... I think is the best option. Is not easy but should be fun, finally you can share this dataset with us.
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
Is there any open-source code can extract content-based music features like tempo, beats, etc.?
My application scenario is like this:
A music repository which contains more than 20,000 songs
Extract features and auto-tag these songs by the "Program" which could be suggested by you.
Auto classify these songs by the tags.
The best open-source audio (waveform, etc.) tool for what you're looking for is the jAudio subset of jMIR: http://jmir.sourceforge.net/jAudio.html. The best tool that you could use for the project, though it's not open-source is free to use, is the EchoNest API. I am assuming that when you say "music" you mean audio; if it's musical scores, you could use the jSymbolic toolkit of jMIR if the data is MIDI or the .features module of my music21 toolkit if it's another format.
Echonest API is the best. I have used it a lot in my projects. I used this tutorial for learning how to use echonest http://preciselyconcise.com/apis_and_installations/echo_nest.php
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 3 years ago.
Improve this question
I would like to ask you to recommend me a C/C++ library to capture video.
I am getting frames from a camera at 16 bits per pixel (RGB), and I would like to save those frames in a video format in an efficient way to be able to look at it in the future.
Would you please recommend me a good C/C++ library that I can use under Linux please?
A Google search gives me a lot of libraries, and I am really not sure which one does a good job and is widely used. I would greatly appreciate your help.
Thank you very much.
As far as I know -- the predominat library is Video4Linux -- however I have not tried it myself, but the list of applications using/supporting it is impressive.
Addition:
For Multi media Encoding GStreamer is probably one of the most used frameworks.
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 6 years ago.
Improve this question
I've noticed that the wiki transcriptions for some of the recent Stack Overflow Podcasts are kind of weak. Clearly, this task calls for a computer program. Is transcribing audio to text (ideally with speaker labels so we know who said what) something that could feasibly be accomplished in software? Are there any active open-source software projects attempting to implement such functionality?
Believe me, I have searched for this before. There are slim to none text to speech that are open source or free to use. From my search there weren't any free speech to text synthesizers. These things are so hard to code and expensive that they can't really be made with an open source approach. If you really need this you would have to purchase it from a company. (although I don't know any off the top of my head).
I've looked into this a little. I tried the Microsoft Speech API but got very poor results. I've been wanting to look into the CMU Sphinx project, especially the Transcriber demo.