icepdf

在Java项目中使用Elasticsearch 6.x (一)对索引的增删改查

ぃ、小莉子 提交于 2020-08-07 04:08:01
一.项目包含的功能 1.高亮搜索; 2.词语自动补全; 3.分页查询; 4.复合查询; 5.对索引的增、删、改、查; 6.对文档的增、删、改; 7.搜索结果排序; 8.其他功能:文件读取(word、txt、pdf) 二.依赖环境版本 1.Elasticsearch 6.6.1; 2.jdk 1.8 3.前端分页插件:网上找的 4.elasticsearch-rest-high-level-client 6.6.1 5.spring boot 2.1.3 6.kibana 6.6.0 三.访问路径 1.项目路径:http://localhost:8080/searchTest.html 2.kibana路径:http://localhost:5601 3.Elasticsearch启动检测路径:http://localhost:9200/ 四.页面效果 五.代码 1.bean实体类 package com.demo.elasticsearch.bean; import java.util.Date; /** * @Author: ln * @Date: 2019/2/26 08:59 * @Description: */ public class FileBean { // text支持分词搜索的字段有:name,author,content,filePath //

java-使用Jacob实现office转换成pdf

不羁岁月 提交于 2020-05-02 04:34:23
注意:com的线程回收不由java垃圾回收器进行处理,因此,每new一次jacob提供的类就要分配一定大小的内存给该操作,new出来的这个com对象在使用结束之后产生的垃圾java是无法回收的,new出来的对象越来越多,内存溢出就不可避免了 https://blog.csdn.net/u011783999/article/details/50897672?tdsourcetag=s_pcqq_aiomsg https://men4661273.iteye.com/blog/2097871 参考路径: https://blog.csdn.net/csdnFlyFun/article/details/79523262#commentBox Jacob组件下载地址: https://sourceforge.net/projects/jacob-project/ Jacob 介绍 Jacob 是 JAVA-COM Bridge的缩写,是一个中间件,能够提供自动化访问MS系统下COM组件和Win32 libraries的功能。 MS系统提供的COM组件 COM组件 对象ID Microsoft Word Word.Application Microsoft Excel Excel.Application Microsoft Powerpoint Powerpoint.Application

Extracting text in a specific region of PDF page using ICEpdf

亡梦爱人 提交于 2019-12-24 07:06:52
问题 Is there any way to extract the text of a specific region using ICEpdf? I was able to extract whole pages, but that's not what I want to do. (I know PDFBox nicely extracts the text in a specific rectangular area of a page. However, since the image rendering works a lot better in ICEpdf, I'd like to use that library.) 回答1: ON the Page object that represents a page you can call the method: PageText pageText = document.getPageText(pagNumber); Similar to the bundle example ./examples/extraction

Icepdf special character rendering issue

丶灬走出姿态 提交于 2019-12-12 17:11:03
问题 I use itext library for creating PDF file because has very detailed rendering functions for PDF creation. When user click the button i write a template and fill the blank cells from DB everytime. Than i use Icepdf library for show to user and taking output of the created pdf file. But Icepdf has some character encoding problem i think. When PDf created and callled by Icepdf one of Turkish character looks as square. Turkish characters can be seen at this link. All characters rendered