I have a Text file on my Desktop AltText.txt, that contains filenames extracted from InDesign.
I have an Excel file that contains the Alt text descriptions, I typically manuel copy paste from Excel to the Text file.
I trying to figure out if there is more efficient away to get Find filenames in the Excel and extract them to the Text file?
Example of Text File:
Forward Collision Alert-4C.ai
Graphic Number : 1
Lane Keep Assist-4C.ai
Graphic Number : 2
Lane Change Alert-4C.ai
Graphic Number : 3
Example of Excel File:
Column A Column B
Forward Collision Alert-4C.ai Icon of Forward Collision Alert
Thank you,
Sergio
Paste the text in a new excel file. > put this formula in sheet2,A1.
=INDIRECT("Sheet1!A"&1+(ROW()-1)*3,TRUE)
and drag downwards
Hope it helps. ( :
I have been able to extract text from multiple pdf files but the original files had double line spacing within 1 column of text which was wrapped or justified so my extracted text also has alot of CR LF within. My issue is when the text wraps sentences also contain CR LF
toy example
This sentence continues
on this line. Next ....
I don't want to loose all the spacing structure like paragraphs so is there a way to unwrap (un-justify) text using python without removing all spacing in the document or intelligently join the lines back?
Eventually I want to spell check the text with Spacy after additional text processing to handle non english text but the justified/wrapped text may be causing misspellings and difficultly detecting all the non english text.
My Excel has a block with a comma , eg. abcd,xyz.
When I convert this Excel to CSV, this text gets converted as 2 columns.
Can anyone advise how I can prevent this?
Excel by default will use a comma as the delimiter, but you can change that:
Go to Control Panel > "Region and Language" (Or just "Region" in Windows 10), and then click the "Additional settings" button on the bottom.
Now look very closely at the List separator item, which normally
has a comma in the field, and for the purposes of this illustration,
change it to a Pipe | character.
Once you hit Apply, and then save your Excel file as a CSV file, you’ll notice that your file now has pipe | characters as the delimiter.
PS: You'll probably want to change the List separator back to a comma just in case some other application needs it.
The Excel file I am using has cells which contains multiple lines typed in by using Alt+Enter. When this Excel is load into teradata, these multiple lines in single cell should reflect in single cell in the teradata table also. But instead, it is getting loaded as several rows. How to rectify this?
Go to Find & Replace (aka Ctrl+H). Type Ctrl+J in the Find what: text box and put a space or some alternate character in the Replace with: text box. Make sure that the Match entire cell contents is not active and click Replace All.
This effectively changes all instances of line feeds (aka CHAR(10), vbLF or ASCII 0010) into spaces. If spaces will not suit your purposes, choose another character or text string.
I have some data of form
[39645961,-79966658]358920045121212[0.75]2013-01-30 20:47:52
[39646124,-79966771]358920045121212[0.5]2013-01-30 20:47:54
[39646134,-79966733]358920045121212[0.5]2013-01-30 20:47:56
[39646123,-79966723]358920045121212[0.5]2013-01-30 20:47:58
[39646144,-79966724]358920045121212[0.5]2013-01-30 20:48:09
......
How can I import them into an excel file into separate columns. like
39645961 -79966658 358920045121212 0.75 2013-01-30 20:47:52
39646124 -79966771 358920045121212 0.5 2013-01-30 20:47:54
39646134 -79966733 358920045121212 0.5 2013-01-30 20:47:5
Any ideas?
If it's not too frequent task:
Copy-paste the text to Excel (will occupy one column)
Data - Text to Columns (Excel 2003)
Delimiters: Comma and Other: ]
After completing the operations, insert a column after the remaining non-splitted fragment (358920045121212[0.75) and repeat Text to Columns for this column only with Other delimiter as [.
1) Copy the data into a text file, like Notepad.
2) Use find and replace to replace bracket characters with a tab character.
You can not directly type a tab character into the replace field, because it will just move your cursor to the next field. To get around this:
Open another Notepad window and press tab, then copy the tab into the replace field of the original Notepad window. Hit replace and repeat this process with space and comma characters.
3) Save and close the notepad file.
4) Open the notepad file in Excel. (choose file, open, and don't forget to change the file type in the open dialog from "All Excel Files" to "All Files"
5) This will open the Text Import Wizard. Hit next, next and finished, and the data should show up in separate columns
If you want to do it strictly in Excel, you will have to extract the individual data elements from each string using a combination of text functions, including SEARCH or FIND, LEFT, MID and RIGHT. The following formulas show one wqy to extract each element from one of the strings, which I have assumed is in A1.
=MID(A1,2,SEARCH(",",A1)-2)
=MID(A1,SEARCH(",",A1)+1,SEARCH("]",A1)-SEARCH(",",A1)-1)
=MID(A1,SEARCH("]",A1)+1,SEARCH("]",A1)+SEARCH("[",MID(A1,SEARCH("]",A1),99))-SEARCH("]",A1)-2)
=MID(A1,SEARCH("[",A1,2)+1,SEARCH("]",MID(A1,SEARCH("[",A1,2)+1,99))-1)
=MID(A1,SEARCH("????-??-??",A1),10)
=RIGHT(A1,8)
You would enter these formulas horizontally to the right of A1, then copy them down.
There is a much simpler way - use a third party piece of software.
The one I used costs me very little for the year, but means i don't need to mess around with trying to get it right.
Its the only tool i found which isn't a monthly subscription as well.
Its a desktop based application.
https://onpage.rocks/product/server-log-tool/