Performance Optimization: Use Static Lookups Instead of Joins

Question

Hi!I wanted to share a powerful and lightweight approach I recently implemented for using static reference data in pipelines—without needing memory-intensive joins or separate file/database reads during runtime.The ChallengeIn typical scenarios, we handle static or reference data (like lookup tables or code descriptions) by:Reading it from a file or databasePerforming a Join Snap to enrich the main data streamWhile effective, joins:Can be memory-heavy, especially with large datasetsAdd complexity to your pipelineRequire both sources to be aligned in structure and timingThe New ApproachInstead of performing a join, we can:Store static reference data as a JSON file in SnapLogic’s SLDBLoad this JSON file in an Expression LibraryUse filter/map function in your pipeline expressions to fetch data from JSON based on a keyNo joins. No file readers. Just fast in-memory lookups!ExampleSample JSON file (staticData,json)[
  { "code": "A1", "desc": "Alpha" },
  { "code": "B2", "desc": "Beta" },
  { "code": "C3", "desc": "Gamma" }
]Define in Pipeline:Usage in Pipeline:lib.static.filter(x =&gt;x.code == $code_from_source).length &gt; 0 ? lib.static.filter(x =&gt;x.code == $code_from_source)[0].desc : "Unknown"This setup allows you to quickly enrich your data using a simple expression, and the same logic can be reused across multiple pipelines via the library.BenefitsFaster: No join processing overheadSimpler pipelines: Fewer snaps and data dependenciesReusable: One JSON file + one function = many pipelinesMemory-efficient: Especially helpful when Snaplex memory is a constraintThings to ConsiderSLDB file size limit: The JSON file stored in SLDB must be under 100MB (SnapLogic’s file size limit for SLDB uploads).Data updates: If your reference data changes frequently (e.g., weekly/monthly), you’ll need to build a separate job or pipeline to overwrite the SLDB file.Search performance: The filter() method checks each item one by one, which can be slow if your JSON has a lot of records. For faster lookups, consider converting the data into a key-value map.Governance: SLDB files have limited access control compared to databases. Ensure your team is aligned on ownership and update responsibility.Maintainability: JSON logic is hardcoded, so changes to structure or logic require modifying the expression library and possibly redeploying affected pipelines.I’ve found this approach especially useful for small to medium-sized static datasets where performance, simplicity, and reusability are key. If you're looking to reduce joins and streamline your pipelines, I highly recommend giving this method a try.To make it easier, I’ve attached a sample pipeline, JSON lookup file, and input CSV so you can see the setup in action. Feel free to explore, adapt, and let me know how it works for you!

ptaylor · Answer

Very good post! Thank you.

Forum Discussion

Performance Optimization: Use Static Lookups Instead of Joins

The Challenge

The New Approach

Example

Benefits

Things to Consider

1 Reply

Recent Discussions

Javascript to promote top level lists

Google Sheets Subscribe questions

Basic string transformations not working

Can we generate XML file in pretty print format using native snapLogic snaps?

Multipart Reader failure - 'content-type' was not found