to my project i need document editor for many types of documents(tabular data, invoices, letters, some formulars,...) and i am looking for text format and editor to acompish my task
is there some MS Word like format?
I know for example rtf, I need formating, invissible comment will be veeery good
open format
same 3th p. editor will be good
I found some solutions, rtf is the most frequent
please have you same suggestions or personal experiences?
Check ODF (Open Document Format) for a free and open document format that can handle text, spreadsheets and many other things.
See Wikipedia for references.
OpenOffice is one of the many products that do support ODF.
Are you looking for a text format or a text editor ? I guess this is a text format...
Many exist (other than RTF that you already mentionned) such as:
HTML
TeX/LaTeX
nroff/troff
Postscript
Many (more or less) wysiwyg editors are available for the first two.
Related
I want a excel macro which search words in PDF and give the page number where macro finds the words. I have 20 words that I want to search in PDF. I have put the keywords in coulmn A of the excel spreadsheet and I want to populate the page number in coulmn b. Please note that I am currently using Adobe reader XI, so please help me with the code which also work in Adobe reader XI.
This is more of a direction and not an answer.
Try searching for command line tools that will export ocr data into a text file. I've looked at them before and a few gave me the option of looking at the particular page of a pdf. All of these tools require a purchase (I was trying to OCR a barcode and I could not find a free tool for this) but there are some free ones out there.
But using excel will make this project harder. I would look at using powershell or some other scripting language and exporting the results into a csv file.
Hope this helps.
I have been searching for the logic to convert the SpreadSheet data of an excel into XML format.
I have the thousands of data like shown below. i want to convert this into XML format.
Anyone please help me, any help would be appreciated greatly.
According to the screenshot, your Excel version allows to save your document using XML Spreadsheet 2003 format: Save As... > Other Formats and locate it in the dropdown (at least for Excel 2007 it works as described).
Resulting XML will contain much of the native Excel Workbook fields and nodes, but they're might be easily removed using any more or less advanced XML editor, e.g. Altova XMLSpy or any similar. However, cleanup depends entirely on your further needs.
For your convenience may see sample Excel book and XML generated from it as described above: https://www.dropbox.com/s/kxmxu2tq52y4m9b/ExcelToXML.zip
Good luck!
I need to convert a pdf form that contains a column of handwritten numbers to text and populate an excel spreadsheet.
Does anyone know of a program or a solution to solve this problem?
Thanks in advance.
Edit:
I have tried programs like pdfcompressor, but its returning me random symbols. Im assuming numbers should be easier to convert than random letters.
If you have a version of Microsoft Office from XP to 2007, you can use Microsoft Office Document Imaging. It is a PDF viewer-like program. Once you open your image file, you can use your mouse icon to crop and highlight sections of the image. You can then copy and paste the highlighted section into Excel using the built-in OCR software.
You'd need an OCR program (google OCR) to interpret the handwritten text/numbers. But that would then only give you a raw text or .doc file, not an excel sheet. You'd need to manually move the numbers across - might still be better than keying them in, if you're looking at a very large list.
Abbyy Finereader would be the first place to start. It has support for machine printed and hand printed OCR and comes with a nice GUI interface. You should be able to download a trial version from www.abbyy.com. It will be able to export to all sorts of formats. If you need an SDK then Kadmos from www.rerecognition.com supports hand and machine print OCR.
I have a form that has TinyMCE for richtext formatting. All of our data is available to export as an HTML report, PDF Report, and Excel Spreadsheet (report).
The fields, that we allow richtext in, show up as the formatted values in both the HTML and PDF reports, but in Excel we show them as strings. For instance:
<b>this part is bold</b><br />line 2 here.
I need a way to make that show up as bold/line-break in excel rather then just showing that string, or at least a way to strip the HTML tags out of there and just show plain text (though I would really like to at least keep the line breaks). Is there some type of macro I can include in the excel download or some C++ program that can convert it or something?
Thanks for your time!
I've done something similar with PHPExcel
The trick is to take your formatted data and find a pattern. In your case, it would probably be table rows/table cells. Iterate through that structure setting the excel cell values as you go. For complex formatting you could fairly simply regex replace what is necessary to get formatted as you desire. The theory may sound a little complicated, but once you get down to it, it's only an hour or two's worth of work.
Certainly there are equivalent programs based on other server technologies. But this one has worked brilliantly for me over the years, and I trust it to work on sites for very big clients with crazy inbound traffic numbers...and it's never failed. It's the only reliable way I've found to write perfect, properly formatted Excel without requiring the user to jump through hoops to get a specific browser.
Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed last month.
Improve this question
Is it possible to convert Excel spreadsheets into MediaWiki markup? I stumpled upon strange recommendations exporting to HTML and convert it into markup. Is there a solution, maybe exporting to XML first or directly converting it?
I know of three options:
Install a WYSIWYG editor extension like FCKeditor (see also Official). Advantage: fairly easy Paste As Word (and therefore also as Excel) button. Disadvantage: installation can be tricky.
Use a macro in Excel. Advantage: a one-click creation of markup. Disadvantage: client-side solution (so need it for all users).
My preference is the FCKeditor option because once it is installed it works pretty well.
You can do this in a graphical interface, with proper software.
Install LibreOffice and add the "wiki publisher" extension, often contained in a package named libreoffice-wiki-publisher.
Open your spreadsheet with LibreOffice Calc, copy your table.
Open a new Writer document, paste with Edit > Paste special (RTF style).
Find "Export" in the menu, select "MediaWiki (txt)" in the format dropdown and confirm.
(Optional.) In your preferred text editor, remove any excess table HTML markup with a couple of simple text replacements and replace the first line {| with {| class="wikitable" to have pretty backgrounds and borders.
LibreOffice is free/open source software and is better than Excel at handling Excel's own spreadsheets, as you see. There used to be an Office plugin too but I've not heard of anyone using it recently.
You may need table styles, available in LibreOffice 5+; but in the meanwhile you can just apply CSS classes to your MediaWiki table.
Alternatively, just copy and paste your table in a page powered by VisualEditor, which is quite good for tables. If your wiki doens't have it, you could still use the MediaWiki.org sandbox: paste your rich text, click the pencil button at the top and then "wikitext/source editing", cut the wikitext and paste into your wiki.
My port of Shan Carter's Mr. Data Converter now supports the Wiki table format. You can copy & paste directly from Excel or from a CSV file.
http://thdoan.github.io/mr-data-converter/
Here is a simple python script that I threw together for my needs. This doesn't handle cell formatting or anything of that nature, but if you just need to get a large table into the MediaWiki format, it'll do the trick. It depends on xlrd.
Usage of this script is as simple as
python xl2wiki.py input.xls
If you want to save the output to another file, just do
python xl2wiki.py input.xls output.txt
MediaWiki supports HTML syntax for tables. The wikitext doesn't look nice and is harder to edit, but if you are just going to copy&paste anyway, it works. And there should be plenty of tools for converting from Excel (or CSV, ODS) to HTML.
Damn, I should find the time to add native CSV supprot to MediaWiki.
I use this macro available on the main Wikipedia site.
It convert the Excel tables to wiki formating. The output is pretty close to the original excel file. it does transfer font formating and cell colors. There is limitation on the borders but they come from mediaWiki system.
You can find the Code at :
https://de.wikipedia.org/wiki/Wikipedia:Technik/Text/Basic/EXCEL-2003_Tabellenumwandlung_VBA
It's 2021 now.
You can copy basic Excel spreadsheets directly into MediaWiki's Visual Editor.
The only thing that's missing in the copy/paste method is cell formatting.