pdi

Unable to connect steps in Pentaho PDI installed on Windows 10

妖精的绣舞 提交于 2019-12-23 17:26:03
问题 I installed a fresh copy of Windows 10 (Education version)and discovered that when I run Pentaho PDI (version 5.4) that I am unable to connect two steps together. Normally there is a dialog window that opens where you can select the add hop but nothing. I installed the JRE and JDK versions of Java. No other issues that I have nothing with other application using Java. I did make the discovery this evening if I place two steps on the work surface and then attempt to execute the transformation,

Error setting up initial repository in Pentaho Kettle

只谈情不闲聊 提交于 2019-12-23 05:16:24
问题 I'm setting up pentaho for the first time. It is able to see mysql but when I try and set up the initial repository it give me this error: org.pentaho.di.core.exception.KettleAuthException: Incorrect password or login Seems to want an admin (username) password. Don't see anything about this in the docs and web searches have not been fruitful. Any help appreciated. 回答1: If you want to create a new Kettle database repository using MySql, you need to follow some setup instructions. You can try

using variable names for a database connection in Pentaho Kettle

好久不见. 提交于 2019-12-22 00:09:29
问题 I am working on PDI kettle. Can we define a variable and use it in a database connection name. So that if in future if i need to change the connections in multiple transformations i would just change the variable value in kettle properties file? 回答1: Just use variables in the Database Connection . For instance ${DB_HostName} , and ${DB_Name} etc. Then just put it in your kettle.properties: DB_HostName=localhost You can see what fields that support variables by the S in the blue diamond. 来源:

pentaho spoon/pid: how to move files to folders with different name everytime?

穿精又带淫゛_ 提交于 2019-12-20 05:28:23
问题 I have new text files every month from where I extract the data and do some transformations. In the end of every month, I need to move these files to a folder with current date in name. Which means, the destination folder's name is different everytime. I made a step before move files that creates a folder and its name is current date (exp: 2019-06-01, 2019-07-01), but then on move files step, I don't know how to specify the destination folder. Guess "wildcard" is only used for source...

Pentaho DI - JSON Nested File Output

痴心易碎 提交于 2019-12-19 10:23:05
问题 I have a requirement where I need to fetch records from multiple tables. The primary table is having one-to-many relationship to other tables. My data source is Oracle DB. Oracle db is having the specified tables. One called Student other one is Subjects. For sample, I have a Student Table where "Student_Id" is the Primary Key and other columns like firstname, lastName etc. Each student have registered for multiple subjects so we have student_id is the foreign key to the Subjects table.

Running pan.bat from command line

耗尽温柔 提交于 2019-12-13 04:57:18
问题 I'm trying to run pan.bat through cmd from my windows os system,I have set the environment variable PENTAHO_JAVA_HOME,seeking help for the same , Thanking in advance. I tried this command to run the .ktr C:\pdi-ce-5.2.0.0-209\data-integration>pan.bat /file:E:\Practise_TRANSFORMATION OUTPUT\dynamic pivot\trying_pivot_with_2_billingid.ktr /level:Basic and this is the error I'm getting WARNING: Using java from path DEBUG: _PENTAHO_JAVA_HOME= DEBUG: _PENTAHO_JAVA=java.exe C:\pdi-ce-5.2.0.0-209

Pivots using ETL Metadata Injection

眉间皱痕 提交于 2019-12-13 01:27:24
问题 It's quite simple to use row denormaliser to achieve pivots when we have few records which can be written manually in denorm step,but what when there's hundreds of thousands of records? I tried using etl metadata injection step, but I was unable to achieve my desired output. Here is the link to my previous post where my source data has been defined. and here is what I have tried row denormaliser options pivot_inject_etl_metadata.ktr 回答1: Try something like this: pivot_inject_etl_metadata.ktr

error while running pan.bat in command line

若如初见. 提交于 2019-12-12 04:18:30
问题 with the reference to my previous post,here is the link I'm still yet facing with some errors. I tried running other .ktr file it ran successfully but when I'm trying to run "pivot_inject_etl_metadata.ktr" it gives me following error C:\pdi-ce-5.2.0.0-209\data-integration>pan.bat /file:E:\Practise_TRANSFORMATION_OUTPUT\dynamic_pivot\pivot_inject_etl_metadata.ktr /level:Basic DEBUG: Using PENTAHO_JAVA_HOME DEBUG: _PENTAHO_JAVA_HOME=C:\Program Files\Java\jre1.8.0_51 DEBUG: _PENTAHO_JAVA=C:

PDI - Read CSV Files, if missing field/data then move to the next file

核能气质少年 提交于 2019-12-11 17:49:07
问题 I'm new with PDI and still learn about it. I'm trying to create transformation that will read all the csv file from one folder, check if the data of the file is correct, meaning there is no rows with missing/error/wrong format, then store it in a database. What I have try is : Use Text File Input accessing CSV file in FTP using Apache Common VFS. Validate and make condition to check the data (checking filename, field if exist) in CSV using Filter Row Output into PostgreSQL Table using

pentaho spoon/PDI : how to create a folder with current date?

跟風遠走 提交于 2019-12-11 17:20:55
问题 As the title says, I'd like to know how to create a folder with current date(dd/mm/yyyy) (on Job). I've seen two answers but they seemed quite long, for example this one: pentaho create archive folder with MM-YYYY Anyone has an idea? Thanks :) 回答1: That answer is also from 2014, better and newer versions of PDI are out, and this answer is overcomplicating the task. Though you do need to create the folder name in a KTR, you can just use the 'Create folder' Job step. I'm pretty sure you cannot