I am using Wrangler to read an excel and transform.
Issue is wrangler gives option for sheet number\name, what I need is to also specify the columns to be read eg 'B1:E450'. I could not get any combination of column declaration to work. Any help is appreciated.
You could use the Excel source plugin in the Pipeline Studio, which would let you specify the columns and a row cap to import from the file. To use Wrangler after that, however, you'd have to manually enter the directives using the Properties view in the Studio.
Related
I noticed that anytime I export from somewhere like Salesforce where the Phone = +123124141.
The data in Excel will just become =123124141 or #NAME? or some other Error spouted from Excel.
This makes data patch extremely painful and manual. Any ideas how to get around this?
When you create the file, use a different extension than .csv or .txt (these ones are interpreted by EXCEL as a green light to interpret columns as numeric/date, etc and format accordingly. Use .DAT, for example. When you open it from within EXCEL it will invoke the text import wizard, and there you should choose comma separated (or tab separated, etc, as the case maybe), and on the final screen of the wizard, choose the columns you don't want reformatted (you can choose all of them, if you like), and choose Text. Now the column values will be kept intact.
Alternatively you can first create a blank workbook/sheet, and use Data From Text to invoke the Wizard and bring the data in the same way.
If you don't want EXCEL to launch and open when you double-click on a .CSV file, without asking questions, you can remove the file-association (of CSV with EXCEL) using Windows explorer.
It is reasonably easy to develop a VBA macro to import such delimited files using 'TEXT' data types to automate what the Wizard would do.
I have a requirement in Talend to read two columns from an xlsx file, but the problem is that one of the columns that I need is a measure whose value is supplied by another excel through a formula. So the Talend metadata is picking up the formula instead of the corresponding value. The Talend environment is hosted in Unix.
Is there any way to work around this without making any change in the input excel?
Any help would be appreciated.
The basic Talend components are fairly limited when it comes to dealing with Excel files.
I found it way better to use the components tFileExcel-Components by Jan Lolling : link to the latest version (30/01/2019).
One of the advantages of theses components over the classic Talend ones is that when reading a formula, the component will try to interpret it*.
*If the formula references another sheet or another file it won't work.
I have a text file that has the variable columns.
I need to convert the variable columns to fixed columns in Excel 2007 (or Access Database 2007 - whichever works better? or however better anywhere!).
How do?
Thanks!
Holly
From the Data tab, touch From Text in the Get external data group.
This will invoke the Import Wizard to allow you to pick the file............You can tell the Wizard to use the tilde as the separator.
Good day !
I need to do the export versions of the log data item in the excel.
This solution I unfortunately cannot use because I'm from Russia, and the solution only supports the Latin alphabet.
so I have the most to learn to extract data from the version history for a single item list.
please help. how is this done?
While I haven't found a clean way of doing this, I have a partial workaround.
Use the Export to Excel option in the List section of the ribbon (make sure the list view you export includes a modified column - thanks T6J2E5).
Save the owssvr.iqy file and open with notepad
Copy just the URL from the file and paste it back into your browser, adding "&IncludeVersions=TRUE"
Save the XML file and open in Excel (or your favorite XML viewer), selecting the "As an XML table" open option.
You'll have to delete the first few columns and rows as they contain the schema data but other than that you should have all the version history (I suggest you add the Version column to the view). You can sort by the Modified column to get a chronological change log of the entire list.
FYI "IncludeVersions=TRUE" should be before the List ID for anyone else that needs this.
spurl/_vti_bin/owssvr.dll?XMLDATA=1&IncludeVersions=TRUE&List={ListID}&View={VIEWID}&RowLimit=0&RootFolder=name
I am facing Error after doing the same that semicolon is missing. how to resolve it.
I have a SSIS package that imports data from an excel file.
One of the columns has mostly numbers (like 12345) but some also have letters (like 123A5)
When i configure the package i can't change the format and it's always recognized as double (probably it only looks at the first lines) .
How can i enforce my own data type , so that it reads the column as string ?
I've also tried to modify the package in visual studio and add the column in the data conversion step but it's still not working.
Yes, as you suspect the Excel driver is checking the first few rows to determine the data type. You can override the default of 8 rows using the MaxScanRows property:
http://support.microsoft.com/kb/236605
http://support.microsoft.com/kb/257819
Alternatively, save the Excel file as .csv first. It's much easier to specify data types for text files.