How do I get the web page contents from a WebView?

后端 未结 7 1863
北恋
北恋 2020-11-22 06:19

On Android, I have a WebView that is displaying a page.

How do I get the page source without requesting the page again?

It seems WebView

相关标签:
7条回答
  • 2020-11-22 07:05

    This is an answer based on jluckyiv's, but I think it is better and simpler to change Javascript as follows.

    browser.loadUrl("javascript:HTMLOUT.processHTML(document.documentElement.outerHTML);");
    
    0 讨论(0)
  • 2020-11-22 07:10

    I know this is a late answer, but I found this question because I had the same problem. I think I found the answer in this post on lexandera.com. The code below is basically a cut-and-paste from the site. It seems to do the trick.

    final Context myApp = this;
    
    /* An instance of this class will be registered as a JavaScript interface */
    class MyJavaScriptInterface
    {
        @JavascriptInterface
        @SuppressWarnings("unused")
        public void processHTML(String html)
        {
            // process the html as needed by the app
        }
    }
    
    final WebView browser = (WebView)findViewById(R.id.browser);
    /* JavaScript must be enabled if you want it to work, obviously */
    browser.getSettings().setJavaScriptEnabled(true);
    
    /* Register a new JavaScript interface called HTMLOUT */
    browser.addJavascriptInterface(new MyJavaScriptInterface(), "HTMLOUT");
    
    /* WebViewClient must be set BEFORE calling loadUrl! */
    browser.setWebViewClient(new WebViewClient() {
        @Override
        public void onPageFinished(WebView view, String url)
        {
            /* This call inject JavaScript into the page which just finished loading. */
            browser.loadUrl("javascript:window.HTMLOUT.processHTML('<head>'+document.getElementsByTagName('html')[0].innerHTML+'</head>');");
        }
    });
    
    /* load a web page */
    browser.loadUrl("http://lexandera.com/files/jsexamples/gethtml.html");
    
    0 讨论(0)
  • 2020-11-22 07:11

    Per issue 12987, Blundell's answer crashes (at least on my 2.3 VM). Instead, I intercept a call to console.log with a special prefix:

    // intercept calls to console.log
    web.setWebChromeClient(new WebChromeClient() {
        public boolean onConsoleMessage(ConsoleMessage cmsg)
        {
            // check secret prefix
            if (cmsg.message().startsWith("MAGIC"))
            {
                String msg = cmsg.message().substring(5); // strip off prefix
    
                /* process HTML */
    
                return true;
            }
    
            return false;
        }
    });
    
    // inject the JavaScript on page load
    web.setWebViewClient(new WebViewClient() {
        public void onPageFinished(WebView view, String address)
        {
            // have the page spill its guts, with a secret prefix
            view.loadUrl("javascript:console.log('MAGIC'+document.getElementsByTagName('html')[0].innerHTML);");
        }
    });
    
    web.loadUrl("http://www.google.com");
    
    0 讨论(0)
  • 2020-11-22 07:12

    If you are working on kitkat and above, you can use the chrome remote debugging tools to find all the requests and responses going in and out of your webview and also the the html source code of the page viewed.

    https://developer.chrome.com/devtools/docs/remote-debugging

    0 讨论(0)
  • 2020-11-22 07:15

    Have you considered fetching the HTML separately, and then loading it into a webview?

    String fetchContent(WebView view, String url) throws IOException {
        HttpClient httpClient = new DefaultHttpClient();
        HttpGet get = new HttpGet(url);
        HttpResponse response = httpClient.execute(get);
        StatusLine statusLine = response.getStatusLine();
        int statusCode = statusLine.getStatusCode();
        HttpEntity entity = response.getEntity();
        String html = EntityUtils.toString(entity); // assume html for simplicity
        view.loadDataWithBaseURL(url, html, "text/html", "utf-8", url); // todo: get mime, charset from entity
        if (statusCode != 200) {
            // handle fail
        }
        return html;
    }
    
    0 讨论(0)
  • 2020-11-22 07:17

    I managed to get this working using the code from @jluckyiv's answer but I had to add in @JavascriptInterface annotation to the processHTML method in the MyJavaScriptInterface.

    class MyJavaScriptInterface
    {
        @SuppressWarnings("unused")
        @JavascriptInterface
        public void processHTML(String html)
        {
            // process the html as needed by the app
        }
    }
    
    0 讨论(0)
提交回复
热议问题