Read from word document line by line

前端 未结 3 1634
孤独总比滥情好
孤独总比滥情好 2020-11-27 18:25


I\'m trying to read a word document using C#. I am able to get all text but I want to be able to read line by line and store in a list and bind

相关标签:
3条回答
  • 2020-11-27 19:00

    Ok. I found the solution here.


    The final code is as follows:

    Application word = new Application();
    Document doc = new Document();
    
    object fileName = path;
    // Define an object to pass to the API for missing parameters
    object missing = System.Type.Missing;
    doc = word.Documents.Open(ref fileName,
            ref missing, ref missing, ref missing, ref missing,
            ref missing, ref missing, ref missing, ref missing,
            ref missing, ref missing, ref missing, ref missing,
            ref missing, ref missing, ref missing);
    
    String read = string.Empty;
    List<string> data = new List<string>();
    for (int i = 0; i < doc.Paragraphs.Count; i++)
    {
        string temp = doc.Paragraphs[i + 1].Range.Text.Trim();
        if (temp != string.Empty)
            data.Add(temp);
    }
    ((_Document)doc).Close();
    ((_Application)word).Quit();
    
    GridView1.DataSource = data;
    GridView1.DataBind();
    
    0 讨论(0)
  • 2020-11-27 19:04

    The above code is correct, but it's too slow. I have improved the code, and it's much faster than the above one.

    List<string> data = new List<string>();
    Application app = new Application();
    Document doc = app.Documents.Open(ref readFromPath);
    
    foreach (Paragraph objParagraph in doc.Paragraphs)
        data.Add(objParagraph.Range.Text.Trim());
    
    ((_Document)doc).Close();
    ((_Application)app).Quit();
    
    0 讨论(0)
  • 2020-11-27 19:09

    How about this yo. Get all the words from the doc and split them on return or whatever is better for you. Then turn into list

       List<string> lines = doc.Content.Text.Split('\n').ToList();
    
    0 讨论(0)
提交回复
热议问题