How to split PDF document into small ones

99封情书 提交于 2020-05-16 04:11:18

问题


I need to split a document into several small documents. For example, if document has 7 pages I need to generate 7 pdfs.

In iTextSharp i was using the following code, works pretty well. However, in iText 7 its not possible to do it in the same way.

iTextSharp old code

var reader = new PdfReader(src);
for (int i = 1; i <= reader.NumberOfPages; i++)
{
    var document = new Document();
    var copy = new PdfCopy(document, new FileStream(result + i + ".pdf", FileMode.Create));
    document.Open();
    copy.AddPage(copy.GetImportedPage(reader, i));
    document.Close();
}

iText 7, but not working

First problem
I have found that there is PdfSplitter, wich could split my pdf into small pdfs. However, even my testing pdf has 7 pages and even GetNumberOfPages() returns number 7, number of splitted document is just one.

In this linked documenation is somehow shown how to split document. However, I have no idea how to make similiar method to the one mentioned - getNextPdfWriter

Second problem
Even I have one file, its empty. I am wondering how to set proper writer to create correct pdf. Respectively, how to set reader in order to read the content of splitted document.

string result = outputPath + @"/page00";
using (pdfDocument = new PdfDocument(new PdfReader(pdfPath)))
{

    var splitter = new PdfSplitter(pdfDocument);
    var splittedDocs = splitter.SplitByPageCount(pdfDocument.GetNumberOfPages());

    for (int i = 0; i < pdfDocument.GetNumberOfPages(); i++)
    {
        //how to set reader to read the content of splitted docs. Or how to set writer for splitted doc.
        var pdfDoc = new PdfDocument(new PdfWriter(new FileStream(result + i + ".pdf", FileMode.Create)));
        pdfDoc.Close();
        splittedDocs[i].Close();
    }
}

Question

How to properly split document into small ones in .NET core with iText 7


回答1:


Well, it was quite easy. According to linked documentation I did the following:

Create custom splitter overriding functionality of PdfSplitter.

class CustomSplitter : PdfSplitter
    {
        private int _order;
        private readonly string _destinationFolder;

        public CustomSplitter(PdfDocument pdfDocument, string destinationFolder) : base(pdfDocument)
        {
            _destinationFolder = destinationFolder;
            _order = 0;
        }

        protected override PdfWriter GetNextPdfWriter(PageRange documentPageRange)
        {
            return new PdfWriter(_destinationFolder + "splitDocument1_" +  _order++ + ".pdf");
        }
    }

Then just use it for splitting a PDF document. Dont forget to close splitted document. And I would like to point out one thing. SplitByPageCount - it takes number according to wich should be splitted. SplitByPageCount(1) split a PDF document by one page. I trully misunderstood this method.

    using (var pdfDoc = new PdfDocument(new PdfReader("doc.pdf")))
    {

        var outputDir = @"C:\";
        var splitter = new CustomSplitter(pdfDoc, outputDir);
        var splittedDocs = splitter.SplitByPageCount(1);

        foreach (var splittedDoc in splittedDocs)
        {
            splittedDoc.Close();
        }
    }

The result is a few pdfs.



来源:https://stackoverflow.com/questions/49930344/how-to-split-pdf-document-into-small-ones

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!