Open XML SDK - Save a template file (.xltx to .xlsx) - excel

I have the following code to open Excel template file and save it as .xlsx file and I get the error below when I try to open the new file. Please help to resolve this.
Excel cannot open the file ‘sa123.xlsx’ because the file format or the extension is not valid. Verify that the file has not been corrupted and that the file extension matches the format of the file.
string templateName = "C:\\temp\\sa123.xltx";
byte[] docAsArray = File.ReadAllBytes(templateName);
using (MemoryStream stream = new MemoryStream())
{
stream.Write(docAsArray, 0, docAsArray.Length); // THIS performs doc copy
File.WriteAllBytes("C:\\temp\\sa123.xlsx", stream.ToArray());
}

In order to do this you will need to use the Open XML SDK 2.0. Below is a snippet of code that worked for me when I tried it:
byte[] byteArray = File.ReadAllBytes("C:\\temp\\sa123.xltx");
using (MemoryStream stream = new MemoryStream())
{
stream.Write(byteArray, 0, (int)byteArray.Length);
using (SpreadsheetDocument spreadsheetDoc = SpreadsheetDocument.Open(stream, true))
{
// Change from template type to workbook type
spreadsheetDoc.ChangeDocumentType(SpreadsheetDocumentType.Workbook);
}
File.WriteAllBytes("C:\\temp\\sa123.xlsx", stream.ToArray());
}
What this code does is it takes your template file and opens it into a SpreadsheetDocument object. The type of this object is Template, but since you want it as a Workbook you call the ChangeDocumentType method to change it from a Template to a Workbook. This will work since the underlying XML is the same between a .xltx and a .xlsx file and it was just the type that was causing you an issue.

Excel sees the .xlsx extension and tries to open it as a worksheet file. But it isn't. It's a template file. When you have a template open in Excel and save it as a .xlsx file, it converts it to the worksheet format. What you are doing is the same as changing the extension in the filename. Try it in Windows Explorer and you will get the same result.
I believe you should be able to accomplish what you want by using the Excel Object Model. I have not used this though.

Related

Reading xls file is returning html tags using library xlsx

I am using xlsx library of nodejs for reading xls file. According to the document the library supports xls file format. On reading I am getting html tags along with it.
I can remove the html tags using regex or replace function but does the library give support to do that as I couldn't find it in the documentation?
Excel File format: Microsoft Excel 97-2003 Worksheet (.xls)
The demo link they have provided in their documentation https://oss.sheetjs.com/sheetjs/ works but when I try to do the same with my code it doesn't give the desired result.
let xlsx = require('xlsx');
let fs = require('fs');
let workBookData = xlsx.readFile('data.xls'); // parses a file
console.log(workBookData);
Here is an image of the result I am getting.
This was an issue/bug in the library. A PR has been created for this and it will be fixed in the new version of the library.

Apache POI appending data to xlsx file when task ran twice

I have a template.xls file that I'm adding data to from some database queries. I add the data and generate a new file named yyyyMMddHHmmss.xls. This works great. The file size is getting large so I'm trying to do the same with an xlsx file. When I generate the file the first time it works great. If I run the process again (even if I restart my java app) it's somehow retaining the last file in memory and appending the data to that file. In both cases it's pulling the source file from template.xls(x) which is an unmodified file.
The code between the two is identical except I'm passing in xlsx instead of xls in the latter case.
ClassLoader classLoader = getClass().getClassLoader();
File file = new File(Objects.requireNonNull(classLoader.getResource("template.xlsx")).getFile());
Workbook workbook = WorkbookFactory.create(file);
// write data
Date date = new Date();
SimpleDateFormat formatter = new SimpleDateFormat("yyyyMMddHHmmss");
String currentDate = formatter.format(date);
FileOutputStream fileOutputStream = new FileOutputStream(currentDate + ".xlsx");
workbook.write(fileOutputStream);
fileOutputStream.close();
workbook.close();
I'm using Java 8u201 and org.apache.poi:poi:4.1.0 (also tried 4.0.1)
As told in Apache POI - FileInputStream works, File object fails (NullPointerException) already, creating a XSSFWorkbook from a File has the disadvantage, that all changes which was made in that workbook always will be stored into that file while XSSFWorkbook.write. This is true even if write writes to another file. But writing explicitly to the same file is not even possible because the File stays open after the workbook was created and so writing into that same file leads to exceptions.
So creating a XSSFWorkbook from a File using
Workbook workbook = WorkbookFactory.create(file);
is not a good idea when file is a *.xlsx file. Instead the Workbook needs to be created using a FileInputstream:
Workbook workbook = WorkbookFactory.create(new FileInputStream(file));
Although the linked SO Q/A is from 2017, the same problem always nor occurs today using apache poi 4.1.0.

Not saving over template

I have an XLS template which I am modifying via Apache POI.
The aim is to modify the template and then email this modified xls spreadsheet.
Once the workbook has been modified accordingly - is there a way to keep this file in memory without saving over the original
POIFSFileSystem fs = new POIFSFileSystem(new FileInputStream("template.xls"));
//do some processing
FileOutputStream fileOut = new FileOutputStream("template.xls");
wb.write();
fileOut.close();
I want to keep this template in tact for the next run. I would have used Apache freemarker but couldnt see xls support.

Using EPPlus 4.501 beta, when I try to save a .xlsx file as a .xlsm file, I get message that the file format is invalid

I'm simply trying to save a .xlsx file as a .xlsm file in preparation to add vba code to it and I get a message saying that the file type is invalid and to check if the file is corrupt.
Here's the code:
var excelFile = new FileInfo(#"wwwroot/SubmissionFiles/23477-1/Financial_Report_0001091596.xlsx");
using (var pck = new ExcelPackage(excelFile))
{
//pck.Workbook.Worksheets.Add("VBA Code");
//pck.Workbook.CreateVBAProject();
pck.SaveAs(new System.IO.FileInfo(#"wwwroot/SubmissionFiles/23477-1/Financial_Report_0001091596.xlsm"));
}
When I attempt to open the file manually, I get the following error message:
Error Message
I'm using Excel 2013 and the code is in VS 2017 targeting .NET Core 2.
You will need to get a code page into that workbook before you can actually save it with the .xlsm extension.
Try adding a new code page with a simple routine.
Try inserting a boilerplate code page.
Then save.

Cannot Retrieve Data from Excel File Created using Aspose.Cells

I create an Excel file (.xlsx) using the Aspose.Cells library. But I'm not able to read the data (retrieve rows) using OleDb commands after that, until I open the file and save it manually. I'm running something as simple as this one:
new OleDbDataAdapter("select * from [Sheet1$]", conn); // etc...
Saving the file increases the size of the file as well. Please note that this happens only with the .xlsx format, for the old .xls everything works fine. I even tried the demo code that they have on their website, but the result is the same. Am I missing something?
It seems you need to set the ExportCellName name property to true before saving to xlsx/xlsm format.
Please see the following sample.
//Create your workbook
Workbook workbook = new Workbook(filePath);
//Do your processing
//Save your workbook with export cell as true
OoxmlSaveOptions opts = new OoxmlSaveOptions();
opts.ExportCellName = true;
workbook.Save("output.xlsx", opts);
Note: I am working as Developer Evangelist at Aspose

Resources