Get translation of words directly in cells - excel

I have a column with (for example) 30 words in French and I want to translate them to English. Each word is in a cell. I usually look them up in dict.cc and choose the best one but the copy/pasting is annoying.
Is it possible to do the follwing?: By moving over the cell (french word) a small box should pop up with all available translations from dict.cc and I want to choose one.

You are able to download the dict.cc Database (FR->EN) (or any other language for that matter) from their page. You would then have to integrate the Data into your File.
Currently the FR->EN File is approximately 667,410KB saved as a Text File. Pasting it to Excel will give you a 3 Column Sheet (1. Column FR, 2. Column EN, 3. Special Conditions).
With a Data Validation or a Lookup you are able to bind the words.
http://www.dict.cc/?s=about%3Awordlist
13. Vokabeldatenbank
Hope this helps you.

Related

Ideas to extract specific invoice pdf data for different formats and convert to Excel

I am currently working on a digitalisation project which consists in extracting specific information from pdf-formatted electricity invoices. Once the data is extracted, I would like to store it in an Excel spreadsheet.
The objectives are the following:
First of all, the data to be extracted would be the following:
https://i.stack.imgur.com/6RLo2.png
In this case, the data to be extracted is the information surrounded in red. This would be the CUPS, the total amount and the consumed electricity per period (P1-P6).
Once this is extracted, I would like to display this in an Excel Spreadsheet.
Could you please give me any ideas/tips regarding the extraction of this data? I understand that OCR software would do this best, but do not know how I could extract this specific information.
Thanks for you help and advice.
If there is no text data in your PDF then I don't believe there is a clean and consistent way to do this yet. If your invoice templates are always the same format and resolution, then the pixel coordinates of the text positions should be the same.
This means that you can create a cropped image with only the text you're interested in. Then you can use your OCR tool to extract all the text and you have extracted your data field. You would have to do this for all the data fields that you want to extract.
This would only work for invoices that always have the same format and resolution. So scanned invoices wouldn't work, and dynamic tables make things exponentially more complex as well.
I would check if its possible to simply extract the text using PDF to text 1st then work my cmd text parsing around that output, and loop file to file.
I don't have your sample to test so you would need to adjust to suit your bills
pdftotext -nopgbrk -layout electric.pdf - |findstr /i "cups factura" & pdftotext -nopgbrk -layout -y 200 -W 300 -H 200 electric.pdf
Personally would use the two parts as separate cycles so first pair replace the , with a safe csv character such as * then inject , for the large gap to make them 2 column csv (perhaps replace the Γé¼ with ,€ if necessary since your captured text may be in €uros already)
The second group I would possibly inject , by numeric position to form the desired columns, I only demo 4 column by 2 rows but you want 7 column by 4 rows, so adjust those values to suit. However, you can use any language you are familiar with such as VBA to split how you want to import in to eXcel.
In Excel you may want to use PowerQuery to read the pdf:
https://learn.microsoft.com/en-us/power-query/connectors/pdf
Then you can further process to extract the data you want within PowerQuery.
If you are interested in further data analysis after extraction you may want to consider KNIME as well:
https://hub.knime.com/jyotendra/spaces/Public/latest/Reading%20PDF%20and%20extracting%20information~pNh3GdorF0Z9WGm8
From there export to Excel is also supported.
edit:
after extracting, regex helps to filter for the specific data, e.g. look for key words, length and structure of the data item (e.g. the CUPS number), is it a currency with decimal etc.
edit 2: regex in Excel
How to use Regular Expressions (Regex) in Microsoft Excel both in-cell and loops
e.g. look for a new line starting with CUPS followed by a sequence of 15-characters (if you have more details, you can specify the matching pattern more: e.g. starting with E, or 5th character is X or 5, etc.)

Finding Unique Characters in Spreadsheet

I have created a spreadsheet that shows me what the text is in each language.
I need to create another cell, that will locate any of the special characters that have any added parts on and locate them in a cell for each language.
Is there a way of doing this without scanning through myself and putting them in?
For example:
English - "COUNTER","CHECK"
German - "ZÄHLER", "PRÜFEN"
Special Character - "Ä" , "Ü"
The file is very large and screenshots will be hard to take. I have attempted picking out the individual characters, but this is very time consuming. I have not yet been able to find anything on this subject.
When it comes to excel im not amazing with all the formulas, i can only do a few things.
Example Image

Excel order writing Arabic text and English Numbers

Problem:
I have a sentence/word I need to past in an excel cell. It consists of an Arabic word, along with some numerical values (in English). For example, let's say I want to write the below
However, excel keeps returning the below
2,1 - الحدث
It you notice, the numerical values are always moved to the left of the word, while I want to keep it on the right.
I tried concatenating, but didn't seem to help. I am open to any solution, vba or not, as I have spent a lot of time on this with no luck.
I added this keyboard to my device and it helped me with that.
Central Kurdish Keyboard
Use it to write the numbers only, and use the normal arabic keyboard to write the rest of the text.
And note that it didn't work with the numpad, it worked only with the numbers row at the top - the one that has the special characters !##$%^&*()

Excel, Numberplate Clarification

I am working on an excel document for fuel cards at the minute and my current issue is to write in a formula for validating number plates based on UK standard plates (two letters followed by two numbers then three letters i.e. BK08JWZ). At this point in time we are not considering personal plates in this just to keep things simple.
Ideally I need excel to look at the text in the box and confirm it to an agreed layout but I am struggling to find the right formula. The plates are in column 'I' and I have already added in another column after titled 'approved plates' in column 'J'but this can be deleted if it's not needed.
Results wise, I can do this one of two ways, to either get the excel document to highlight and number plates that do not match the DVLA standard , or have a column next to the number plate column that registers a boolean response to the recognition i.e. If it is valid (true) or if not (false).
Either way the plate needs to be able to be seen as it was currently, so if there is something wrong with it, it needs to be visible, not throw up an error message.
Any help would be very welcome.
All the information on UK standard number plates are on this site:
https://www.gov.uk/government/uploads/system/uploads/attachment_data/file/359317/INF104_160914.pdf
I would do it like this:
1) create a lookup sheet with data from the booklet. One column for allowed "memory tag" identiffiers (first two letters), one column for the allowed "age identiffiers" (first two numbers), and one column for allowed random letters (last three letters, full alphabet except I and Q)
2) strip spaces from the number plate for comparison
3) Use MID(numberplate,1,2), MID(numberplate,3,2) and MID(numberplate,5,3) to compare to each lookup list repectively (using INDEX()>0).
4) when all 3 parts are found in lookup lists the number plate is valid.
Try researching Regular Expressions or RegEx. This is a powerful programming tool to determine whether strings match specific patterns. You can use RegEx expressions to extract the pattern, replace the pattern or test for the pattern. Very efficient but not for the faint-hearted although there is plenty of help on-line. Try this article for starters.
The following RegEx may be what you need..
(?^[A-Z]{2}[0-9]{2}[A-Z]{3}$)|(?^[A-Z][0-9]{1,3}[A-Z]{3}$)|(?^[A-Z]{3}[0-9]{1,3}[A-Z]$)|(?^[0-9]{1,4}[A-Z]{1,2}$)|(?^[0-9]{1,3}[A-Z]{1,3}$)|(?^[A-Z]{1,2}[0-9]{1,4}$)|(?^[A-Z]{1,3}[0-9]{1,3}$)
This was copied from this article which gives a very full explanation using DVLA rules.
EDIT:
To use RegEx within Excel. In the IDE, Tools menu, select References and add the Microsoft VBScript Regular Expressions 5.5 reference.
With acknowlegement to user3616725s helpful observation.

Excel - replace words automatically

I don't know if what I'll be asking can be done but I hope so.
I follow races and after each seassion there is a table showing the standing of the drivers and teams. For a weekend, there are 5 tables.
For each session, I copy the table, paste it into excel and import it on my website to desplay it as a table. But before I do that, I have to rename the drivers and teams names (always the same in all tables) to another language and then import the file.
This process takes few minutes, like 5 minutes. 5*5 tables: 25 minutes for a weekend. So I wondering if there 's a trick in a software or excel that could replace all the words I want with other words automatically? that could save me a lot of time.
So next time I post the table into excel, I see the drivers' name automatically in my language, thus I can import the table without edit it and waste few minutes.
I really appreciate your help and hope I can have a positive answer from this community.
Thanks!
I fear there may be more required than relatively simple lookup formulae but for want of any other answer so far and focussing on your title, please try:
=VLOOKUP(B2,drivers,2,0)
in E2 for the example below, and copy down to suit. This assumes you have an array as in the box in the middle named drivers (need not be in same sheet) and that ColumnD is a copy of ColumnA.
Similarly in G2:
=VLOOKUP(C2,team,2,0)
where the right-hand box is an array named team, and ColumnC is after relocation.

Resources