How can I convert HTML to Textile?

前端 未结 5 1512
既然无缘
既然无缘 2021-02-14 01:05

I\'m scraping a static html site and moving the content into a database-backed CMS. I\'d like to use Textile in the CMS.

Is there a tool out there that converts HTML

相关标签:
5条回答
  • 2021-02-14 01:29

    I know this is an old question, but I found myself trying to do this the other day and not finding anything useful, until I found Pandoc. It can convert loads of other markup formats as well - it's quite brilliant.

    0 讨论(0)
  • 2021-02-14 01:37

    Here is a c# lib converting html 2 textile. Though it is textile with their additions. Not pure textile.

    0 讨论(0)
  • 2021-02-14 01:40

    Since there was no javascript implementation, I wrote one: https://github.com/cmroanirgo/to-textile

    It's a little primitive at the moment, as it's a blind port of the 'to-markdown' equivalent, but should get the job done.

    0 讨论(0)
  • 2021-02-14 01:42

    try this simple java code hope it work for you

    import java.net.*;
    import java.io.*;
    
    class Crawle
    {
    
    public static void main(String ar[])throws Exception
    {
    
    
    URL url = new URL("https://www.google.co.in/#q=i+am+happy");
    InputStream io =  url.openStream();
    BufferedReader br = new BufferedReader(new InputStreamReader(io));
    FileOutputStream fio = new FileOutputStream("crawler/file.txt");
    PrintWriter pr = new PrintWriter(fio,true);
    String data = "";
    while((data=br.readLine())!=null)
    {
    pr.println(data);
    System.out.println(data);
    }
    
    }
    }
    }
    
    0 讨论(0)
  • 2021-02-14 01:48

    This is a simple markup replacement, nothing a good regex could not fix.

    I recommend Perl, LWP::Simple and some regexes to do the whole thing (spidering, stripping design and menus, converting to textile, and then posting to the database.)

    0 讨论(0)
提交回复
热议问题