Reading .doc file without launching MSWord

后端 未结 3 857
广开言路
广开言路 2021-01-19 10:34

I\'m trying to open .doc file and read its content. But i can\'t find any way how to do this without launching MSWord.

Now I have following code:

Mic         


        
相关标签:
3条回答
  • 2021-01-19 11:11

    Add the Namespace using Add Reference-->Browse-->Code7248.word_reader.dll

    Download dll from the given URL :

    sourceforge.net/p/word-reader/wiki/Home

    (A simple .NET Library compatible with .NET 2.0, 3.0, 3.5 and 4.0 for C#. It can currently extract only the raw text from a .doc or .docx file.)

    The Sample Code is in simple Console in C#:

    using System;
    using System.Collections.Generic;
    using System.Text;
    //add extra namespaces
    using Code7248.word_reader;
    
    
    namespace testWordRead
    {
        class Program
        {
            private void readFileContent(string path)
            {
                TextExtractor extractor = new TextExtractor(path);
                string text = extractor.ExtractText();
                Console.WriteLine(text);
            }
            static void Main(string[] args)
            {
                Program cs = new Program();
                string path = "D:\Test\testdoc1.docx";
                cs.readFileContent(path);
                Console.ReadLine();
            }
        }
    }
    

    It is working fine.

    0 讨论(0)
  • 2021-01-19 11:18

    Two possibilities: either use Microsoft's spec to write your own parser for the .doc format, or use an existing library for the purpose (e.g., from Aspose). Unless you have a couple of spare years to spend on the task, the latter is clearly the correct choice.

    0 讨论(0)
  • 2021-01-19 11:25

    Last time I did this (via COM from C++), I recall a 'Visible' property in the Application interface (true=visible).

    However, it seems to me that the default was false, so you had to set it to true to make Word appear.

    Regardless of whether or not the user can see Word, you will still see winword.exe (or whatever it's called today) in your task manager. I don't think there's a way to access Word through this interface without it launching Word (behind the scenes or not).

    If you don't want Word to launch at all, you may have to find another solution.

    0 讨论(0)
提交回复
热议问题