Tuesday, April 14, 2015

How to create a simple job from metadata

To start with creating metadata from the CSv file go to the Repository view, right-click on Job Designs and select Create job. 

The pop window New Job opens write the name for example: MyTalendFirstJob and click finish button.

Now you have to drag and drop Student_CSV schema from metadata file delimited on design workspace.

Then it shows this window select tFileInputDelimited and click OK button.

Here your Student input file delimited has been dropped. Or you can simply select tInputFileDelimited component from Palette and drag and drop on design workspace.

Next if you do not want to browse the components  from the Palette you can search the component directly in the search field at the top of the Palette.

Drag and drop tlogrow on design workspace
Right click on Student input file delimited and connect the component pairs using a Row > Main connection .

Now double click on row1(main) link and write output if you want to write.

Then double on Student CSV file input delimited here each component is defined by basic and advanced properties shown respectively on the Basic Settings tab and the Advanced Settings tab of the Component view .

Now double click on tlogrow component

You can click on edit schema button to create your built-in schema by adding columns and describing their content, according to the input file definition.

At last you can run the job by clicking Run Button to start the execution .Then it shows the progress information screen here tLogRow component is used in the Job design to show the job output.

If you want to run a job again then first you have to click the Clear button and if for any reason you want to stop the job then click the Kill button.
How to Run a Job in Traces Debug Mode
Here when you click the Traces Debug then it will display the row by row result in the table format.

Creating schema will help you to re-use it when ever it is required across the job, so it is very important to create the schema under metadata section of the talend which will help you to build the job very quickly if you are going to use the same file or table in more than one job.

No comments:

Post a Comment