Convert a simple mono drawing image to a 2d text array

后端 未结 1 1968
孤独总比滥情好
孤独总比滥情好 2021-01-23 14:25

I am trying to develop an algorithm that converts simple mono line images ie Maze, to a text 2d array.

For example, the image below, it would be converted to the followin

1条回答
  •  佛祖请我去吃肉
    2021-01-23 14:47

    Interseting problem its basically vector form of Image to ASCII art conversion... I managed to do this with this algorithm:

    1. preprocess image

      You gave us JPG which has lossy compresion meaning your image contain much more than just 2 colors. So there are shades and artifacts which will screw things up. So first we must get rid of those by thresholding and recoloring. So we can have 2D BW image (no grayscales)

    2. vectorize

      Your maze is axis aligned so it contains only horizontal and vertical (h,v) lines. So simply scan each line of image find first starting wall pixel then its ending pixel and store somewhere... repeat until whole line is processed and do this for all lines. Again do the same for rows of image. As your image has thick walls ignore lines sorter than thickness threshold and remove adjacent (duplicates) line that are (almost) the same.

    3. get list of possible grid coordinates from h,v lines

      simply make a list of all x and y (separately) coordinates from lines start and end points. Then sort them and remove too close coordinates (duplicates).

      Now the min and max values gives you AABB of your maze and GCD of all the coordinate-lowest coordinate will give you grid size.

    4. align h,v lines to grid

      simply round all start/end points to nearest grid position ...

    5. create text buffer for maze

      AABB along with grid size will give you resolution of your maz in cells so simply create 2D text buffer where each cell has NxN characters. I am using 6x3 cells which looks nice enough (square and with enough space inside).

    6. renmder h,v lines into text

      simply loop through all lines and render - or | instead of pixels... I am using also + if the target position does not contain ' '.

    7. convert 2D text array into wanted text output

      simply copy the lines into single text ... or if you clever enough you can have 1D and 2D at the same memory place with eol encoded between lines.

    Here simple example in C++/VCL I made from the exampe in the link above:

    //---------------------------------------------------------------------------
    #include 
    #include 
    #pragma hdrstop
    
    #include "win_main.h"
    #include "List.h"
    //---------------------------------------------------------------------------
    #pragma package(smart_init)
    #pragma resource "*.dfm"
    TForm1 *Form1;
    Graphics::TBitmap *bmp=new Graphics::TBitmap;
    int txt_xs=0,txt_ys=0,txt_xf=0;
    //---------------------------------------------------------------------------
    template  void sort_asc_bubble(T *a,int n)
        {
        int i,e; T a0,a1;
        for (e=1;e;n--)                                     // loop until no swap occurs
         for (e=0,a0=a[0],a1=a[1],i=1;ia1)                                        // condition if swap needed
          { a[i-1]=a1; a[i]=a0; a1=a0; e=1; }               // swap and allow to process array again
        }
    //---------------------------------------------------------------------------
    AnsiString bmp2lintxt(Graphics::TBitmap *bmp)
        {
        bool debug=false;
        const int cx=6;             // cell size
        const int cy=3;
        const int thr_bw=400;       // BW threshold
        const int thr_thickness=10; // wall thikness threshold
        char a;
        AnsiString txt="",eol="\r\n";
        int x,y,x0,y0,x1,y1,xs,ys,gx,gy,nx,ny,i,i0,i1,j;
        union { BYTE db[4]; DWORD dd; } c; DWORD **pyx;
        List h,v;  // horizontal and vertical lines (x,y,size)
        List tx,ty;// temp lists for grid GCD computation
        // [init stuff]
        bmp->HandleType=bmDIB;
        bmp->PixelFormat=pf32bit;
        xs=bmp->Width ;
        ys=bmp->Height;
        if (xs<=0) return txt;
        if (ys<=0) return txt;
        pyx=new DWORD*[ys];
        for (y=0;yScanLine[y];
        i=xs; if (i=thr_bw) c.dd=0x00FFFFFF;
            else           c.dd=0x00000000;
            pyx[y][x]=c.dd;
            }
        if (debug) bmp->SaveToFile("out0_bw.bmp");
        // [vectorize]
        // get horizontal lines
        i0=0; i1=0; h.num=0;
        for (y0=0;y0thr_thickness)
                    {
                    h.add(x0);
                    h.add(y0);
                    h.add(i);
                    }
                x0=x1+1;
                }
            // remove duplicate lines
            for (i=i0;ithr_thickness)
                    {
                    v.add(x0);
                    v.add(y0);
                    v.add(i);
                    }
                y0=y1+1;
                }
            // remove duplicate lines
            for (i=i0;ix) x0=x; if (x1=0) tx.add(x);
            if (y0>y) y0=y; if (y1=0) ty.add(y);
            x+=h[i+2];
            if (x0>x) x0=x; if (x1=0) tx.add(x);
            }
        for (i=0;ix) x0=x; if (x1=0) tx.add(x);
            if (y0>y) y0=y; if (y1=0) ty.add(y);
            y+=v[i+2];
            if (y0>y) y0=y; if (y1=0) ty.add(y);
            }
        // order tx,ty
        sort_asc_bubble(tx.dat,tx.num);
        sort_asc_bubble(ty.dat,ty.num);
        // remove too close coordinates
        for (i=1;ix) gx=x; } nx=(x1-x0+1)/gx; gx=(x1-x0+1)/nx; x1=x0+nx*gx;
        for (gy=y1-y0,i=1;iy) gy=y; } ny=(y1-y0+1)/gy; gy=(y1-y0+1)/ny; y1=y0+ny*gy;
        // align x,y to grid: multiplicate nx,ny by cx,cy to form boxes and enlarge by 1 for final border lines
        nx=(cx*nx)+1;
        ny=(cy*ny)+1;
        // align h,v lines to grid
        for (i=0;i>1))/gx)*gx; h[i+0]=x+x0;
            y=h[i+1]-y0; y=((y+(gy>>1))/gy)*gy; h[i+1]=y+y0;
            j=h[i+2];    j=((j+(gx>>1))/gx)*gx; h[i+2]=j;
            }
        for (i=0;i>1))/gx)*gx; v[i+0]=x+x0;
            y=v[i+1]-y0; y=((y+(gy>>1))/gy)*gy; v[i+1]=y+y0;
            j=v[i+2];    j=((j+(gy>>1))/gy)*gy; v[i+2]=j;
            }
        // [h,v lines -> ASCII Art]
        char *text=new char[nx*ny];
        char **tyx=new char*[ny];
        for (y=0;yCanvas->Pen->Color=TColor(0x000000FF);
        for (i=1,x=x0;i;x+=gx)
            {
            if (x>=x1){ x=x1; i=0; }
            bmp->Canvas->MoveTo(x,y0);
            bmp->Canvas->LineTo(x,y1);
            }
        for (i=1,y=y0;i;y+=gy)
            {
            if (y>=y1){ y=y1; i=0; }
            bmp->Canvas->MoveTo(x0,y);
            bmp->Canvas->LineTo(x1,y);
            }
        if (debug) bmp->SaveToFile("out1_grid.bmp");
        // h,v lines
        bmp->Canvas->Pen->Color=TColor(0x00FF0000);
        bmp->Canvas->Pen->Width=2;
        for (i=0;iCanvas->MoveTo(x,y);
            bmp->Canvas->LineTo(x+j,y);
            }
        for (i=0;iCanvas->MoveTo(x,y);
            bmp->Canvas->LineTo(x,y+j);
            }
        bmp->Canvas->Pen->Width=1;
        if (debug) bmp->SaveToFile("out2_maze.bmp");
    
        delete[] pyx;
        return txt;
        }
    //---------------------------------------------------------------------------
    void update()
        {
        int x0,x1,y0,y1,i,l;
        x0=bmp->Width;
        y0=bmp->Height;
        // Font size
        Form1->mm_txt->Font->Size=Form1->cb_font->ItemIndex+4;
        txt_xf=abs(Form1->mm_txt->Font->Size);
        // mode
        Form1->mm_txt->Text=bmp2lintxt(bmp);
        // output
        Form1->mm_txt->Lines->SaveToFile("pic.txt");
        x1=txt_xs*txt_xf;
        y1=txt_ys*abs(Form1->mm_txt->Font->Height);
        if (y0flb_pic->Width;
        y0+=Form1->pan_top->Height;
        if (x0<340) x0=340;
        if (y0<128) y0=128;
        Form1->ClientWidth=x0;
        Form1->ClientHeight=y0;
        Form1->Caption=AnsiString().sprintf("Picture -> Text ( Font %ix%i )",abs(Form1->mm_txt->Font->Size),abs(Form1->mm_txt->Font->Height));
        }
    //---------------------------------------------------------------------------
    void draw()
        {
        Form1->ptb_gfx->Canvas->Draw(0,0,bmp);
        }
    //---------------------------------------------------------------------------
    void load(AnsiString name)
        {
        if (name=="") return;
        AnsiString ext=ExtractFileExt(name).LowerCase();
        if (ext==".bmp")
            {
            bmp->LoadFromFile(name);
            }
        if (ext==".jpg")
            {
            TJPEGImage *jpg=new TJPEGImage;
            jpg->LoadFromFile(name);
            bmp->Assign(jpg);
            delete jpg;
            }
        bmp->HandleType=bmDIB;
        bmp->PixelFormat=pf32bit;
        Form1->ptb_gfx->Width=bmp->Width;
        Form1->ClientHeight=bmp->Height;
        Form1->ClientWidth=(bmp->Width<<1)+32;
        }
    //---------------------------------------------------------------------------
    __fastcall TForm1::TForm1(TComponent* Owner):TForm(Owner)
        {
        }
    //---------------------------------------------------------------------------
    void __fastcall TForm1::FormDestroy(TObject *Sender)
        {
        delete bmp;
        }
    //---------------------------------------------------------------------------
    void __fastcall TForm1::FormPaint(TObject *Sender)
        {
        draw();
        }
    //---------------------------------------------------------------------------
    void __fastcall TForm1::flb_picChange(TObject *Sender)
        {
        load(flb_pic->FileName);
        update();
        }
    //---------------------------------------------------------------------------
    void __fastcall TForm1::FormActivate(TObject *Sender)
        {
        flb_pic->SetFocus();
        flb_pic->Update();
        if (flb_pic->ItemIndex==-1)
         if (flb_pic->Items->Count>0)
            {
            flb_pic->ItemIndex=0;
            flb_picChange(this);
            }
        }
    //---------------------------------------------------------------------------
    

    Just ignore the VCL stuff and convert the resulting text into whatever you have at disposal. I also use mine dynamic list template so:


    List xxx; is the same as double xxx[];
    xxx.add(5); adds 5 to end of the list
    xxx[7] access array element (safe)
    xxx.dat[7] access array element (unsafe but fast direct access)
    xxx.num is the actual used size of the array
    xxx.reset() clears the array and set xxx.num=0
    xxx.allocate(100) preallocate space for 100 items

    So use whatever list you got or recode or use std::vector instead...

    I edited out the texts from your image:

    And this is the result using that as input:

    +-----------+------------------     |
    |           |                       |
    |           |                       |
    |     |     |     |     +-----------+
    |     |           |     |           |
    |     |           |     |           |
    |     +-----------+     +-----+     |
    |                 |           |     |
    |                 |           |     |
    +------     |     +-----+     |     |
    |           |           |           |
    |           |           |           |
    |     ------+-----------+------     |
    |                                   |
    |                                   |
    |     ------------------------------+
    

    And here the saved debug bitmaps (from left to right: BW,Grid,Maze):

    The only important stuff from the code is function:

    AnsiString bmp2lintxt(Graphics::TBitmap *bmp);
    

    Which returns text from VCL (GDI based) bitmap.

    0 讨论(0)
提交回复
热议问题