CData Sync - CSV Source file questions

Question

Hello, couple of CSV source related questions…

How do I indicate, for a CSV-based source is, what the primary key column is for each table, so that an upsert can be properly done?
Is it possible to move the files to another folder (in the operating system) after Sync has ingested them? We would want to archive the files so they don’t get re-processed the next time. I can do this outside of Sync if necessary, but it would be cleaner if Sync could do this somehow (or call out to a batch file on the OS and run that).
In my case, I have many source files with a timestamp, like MYFILE_XYZ_20231006.TXT. I would like to use the file Aggregation so that I can just pull them all into the same target table. However when I do that, is there a way to pass along the individual filename being processed, into a column in the target table?

Thank you for any assistance you can provide!

Taylor · Accepted Answer

Hi @DougN Nouria,You can setthe Primary Keys of a source table by modifying the query for the Task. For instance, let's say I have a source file called DateTest.csv and I want to set "Col1” as the PK. If you edit the task and navigate to the "Query” tab, you can then edit the query syntax like so:REPLICATE [DateTest.csv] ([Col1] VARCHAR(255), PRIMARY KEY ([Col1])) SELECT * FROM [DateTest.csv]This syntax also supports composite keys:REPLICATE [DateTest.csv] ([Col1] VARCHAR(255), [Col2] INT,PRIMARY KEY ([Col1], [Col2])) SELECT * FROM [DateTest.csv]Another option is to set the UseRowNumbers connection property to true in your CSV Connection. This will automatically add"Row Number” as column and sets that as the Primary Key to the table.CData Sync supports both of those options through Events. Here you can define executions that run before and/orafter a job completes.Here is a code example of running a batch file called myfile.bat:Here is an example that moves a file from one directory to anotherThis requiresusing theLoad from Folderjob type. It enhances theAggregate Filesproperty by adding additional metadata to the destination table (filename, filemodtime, rownumber) andensures only new/updated files are processed while skipping files that have already been replicated.

DougN Nouria · Answer

Awesome, thank you so much, I will check this out!

Reply

Connect With Us

Connect With Us

Reply

Sign up

CData Community

Scanning file for viruses.

This file cannot be downloaded

Connect With Us

Connect With Us