privacyapiproject

Build a Privacy‑Preserving Restaurant Recommender Microservice (Maps + Local ML)

UUnknown

2026-02-10

11 min read

Build a privacy-preserving restaurant recommender: use tile-based maps APIs and on-device ML to rank restaurants without sending PII to third parties.

Stop leaking preferences to third parties: a fast, practical architecture

Decision fatigue, fragmented APIs, and privacy anxiety are real—especially for students and makers building dining apps or small microservices. You want recommendations that feel personal, but you don't want to hand your location, eating habits, or activity traces to big analytics vendors. In 2026, the best pattern is simple: fetch maps data from trusted sources, keep user preferences on the device, and run ranking locally. This article gives you a complete microservice API design, backend implementation notes, and runnable client code that combines maps data with on-device preference scoring—so personal data never leaves the user's phone.

Why this matters in 2026 (trends and context)

Two big trends make this architecture timely:

On-device ML and edge compute are mainstream. Mobile NPUs and frameworks like Core ML, TensorFlow Lite, ONNX Runtime Mobile, and emerging WebNN support performant local inference even for small models. Devices in 2026 routinely run models for user personalization without server roundtrips.
Privacy-by-design and data minimization are expected. Regulatory pressure (GDPR variants, CPRA, and new EU AI Act enforcement patterns) plus platform changes (app-scoped IDs, privacy sandboxes) push developers to minimize PII sent to external APIs. Aggregated analytics backends such as ClickHouse-powered stacks are popular for retaining aggregated metrics without exposing raw PII.

Recommendation: design for the device to own the profile. The server supplies public POIs and lightweight signals; the device keeps the user model and does the final ranking.

High-level architecture

Here's the recommended architecture, described in layers so you can implement incrementally:

Maps data sync (server): Regularly ingest POI metadata from a maps provider (Google Places, OpenStreetMap, HERE, Mapbox). Store POIs in a spatial index or OLAP store for fast tile queries.
Tile microservice (server): Expose a tile- or geohash-based API that returns anonymized POI lists for a small area. The client requests tiles by ID (not raw coordinates) to minimize location precision leakage.
Local scorer (client): A small model or scoring function lives on the device—either as a lightweight JSON weights file or a tiny TF Lite / ONNX model. The client fetches tile POIs and ranks them locally using the model and local signals (current activity, explicit likes, dietary flags).
Optional federated updates (privacy-preserving): Use federated learning or aggregated differential privacy to push model updates without sending raw user data back to the server.

API design: tile-first, privacy-focused

Design the backend API so it never needs exact user coordinates or a persistent user identifier. Use tile keys (geohash or S2 cell ID) and context-only filters.

Key endpoints

GET /v1/tiles/{tileId} — returns POIs for a tile (geohash or S2)
GET /v1/places/{placeId} — detailed POI info (minimal fields)
GET /v1/context-factors — optional non-personal signals (time buckets, aggregated popularity, anonymized wait estimates)
GET /v1/models/latest — fetch the latest public model or weight file (no user data included)

Tile API JSON shape (example)

Return a minimal, typed payload that contains only the data needed for ranking and display.

{
  "tileId": "9q9hv",
  "bounds": { "ne": [37.79, -122.39], "sw": [37.77, -122.41] },
  "places": [
    {
      "id": "place_2043",
      "name": "La Cocina Tacos",
      "lat": 37.7831,
      "lng": -122.3975,
      "category": "Mexican",
      "price_level": 2,
      "rating": 4.5,
      "tags": ["tacos", "casual", "outdoor_seating"]
    }
  ],
  "fetched_at": "2026-01-05T12:00:00Z"
}

Design notes: keep fields compact. Avoid returning internal popularity by user identifiers. If you compute popularity, use aggregated counts with noise (differential privacy) or top-10 trending flags only.

Server-side implementation: Node.js microservice example

This Express example implements a tile endpoint that reads POIs from a pre-populated store (could be ClickHouse, PostGIS, or a simple file during prototyping). The service never receives raw user coordinates; the client computes tileId locally.

// server/app.js (Node.js + Express)
const express = require('express');
const cache = require('node-cache');
const POI_STORE = require('./poi_store'); // abstracted store

const app = express();
const myCache = new cache({ stdTTL: 60 });

app.get('/v1/tiles/:tileId', async (req, res) => {
  const tileId = req.params.tileId;
  // Rate limit and simple auth (API key)
  const key = req.header('x-api-key');
  if (!key || key !== process.env.SERVICE_KEY) return res.status(401).send({ error: 'unauthorized' });

  // Cache hits
  const cached = myCache.get(tileId);
  if (cached) return res.json(cached);

  // Query POIs from the store by tileId
  const places = await POI_STORE.getByTile(tileId);

  // Minimal shaping
  const payload = {
    tileId,
    bounds: POI_STORE.tileBounds(tileId),
    places: places.map(p => ({
      id: p.id,
      name: p.name,
      lat: p.lat,
      lng: p.lng,
      category: p.category,
      price_level: p.price_level,
      rating: p.rating,
      tags: p.tags
    })),
    fetched_at: new Date().toISOString()
  };

  myCache.set(tileId, payload);
  res.json(payload);
});

app.listen(3000, () => console.log('Tile microservice listening on 3000'));

Notes:

POI_STORE might be a ClickHouse or PostGIS backed service. ClickHouse is increasingly used for aggregated analytics and can support large-scale tile slicing when combined with spatial indices.
Do not log raw API requests containing potentially sensitive headers. Rotate SERVICE_KEY regularly and issue per-device ephemeral keys if you need throttling.

Client-side: compute tile and rank locally

The client computes its tile ID from device location using geohash or S2. It then fetches the tile and runs a local scorer. The local scorer keeps the user profile (likes, dislikes, dietary flags) and applies a model to score each place.

Simple client scorer (JavaScript, no external inference engines)

This example uses a small weighted scoring function stored on-device. It's explainable, easy to update, and runs instantly. For more advanced use, swap in a TF Lite or ONNX model and use native inference.

// client/scorer.js
// userProfile stored only on device
const userProfile = {
  prefers: { Mexican: 0.8, Japanese: 0.2 },
  dislikes: { 'fast_food': 1.0 },
  max_price_level: 2,
  distance_weight: 0.6,
  rating_weight: 0.4
};

function scorePlace(place, userLocation) {
  // 1) Cuisine match
  const cuisineScore = userProfile.prefers[place.category] || 0.1;

  // 2) Price filter
  const priceScore = place.price_level <= userProfile.max_price_level ? 1 : 0.2;

  // 3) Rating
  const ratingScore = (place.rating || 3) / 5; // normalized

  // 4) Distance (simple haversine)
  const distanceKm = haversineKm(userLocation, { lat: place.lat, lng: place.lng });
  const distanceScore = Math.max(0, 1 - (distanceKm / 5)); // prefer within 5km

  // Weighted sum
  const score = cuisineScore * 0.4 + priceScore * 0.1 + ratingScore * userProfile.rating_weight + distanceScore * userProfile.distance_weight;
  return { placeId: place.id, score, distanceKm };
}

function haversineKm(a, b){
  const R = 6371;
  const toRad = x => x * Math.PI / 180;
  const dLat = toRad(b.lat - a.lat);
  const dLon = toRad(b.lng - a.lng);
  const lat1 = toRad(a.lat);
  const lat2 = toRad(b.lat);
  const sinDLat = Math.sin(dLat/2), sinDLon = Math.sin(dLon/2);
  const c = 2 * Math.asin(Math.sqrt(sinDLat*sinDLat + Math.cos(lat1)*Math.cos(lat2)*sinDLon*sinDLon));
  return R * c;
}

export { scorePlace };

This local scoring function means you never send the user's preferences or exact position to the server—only the tileId (geohash) was requested.

Alternative: a tiny TF Lite model

If you prefer a trained model, train on anonymized, aggregated data and export a small model (few KBs to a few MBs). Distribute via /v1/models/latest. On mobile, use Core ML or TF Lite; on web, use TF.js or ONNX.js. Keep model inputs strictly limited to non-PII features (place attributes, time-of-day bucket, local device signals like step count if allowed).

Protecting privacy end-to-end

Here are practical safeguards you should adopt.

Tile-based requests: the client sends a geohash or S2 cell ID instead of precise coords. Choose a precision that balances utility vs. privacy (e.g., geohash length 6-7 for city-block granularity).
Minimize fields: only include fields the client needs to rank and display. Avoid unique tracking tokens or direct links to third-party analytics.
Ephemeral keys: use short-lived service tokens for rate limiting. Do not attach device identifiers.
Aggregation with noise: publish popularity or wait-time estimates using differential privacy techniques to prevent de-anonymization when reporting statistics.
Local storage security: store user weights and preferences in secure storage (Keychain / Keystore / secure local DB). Encrypt at rest. Avoid syncing preferences to cloud backups unless the user explicitly opts in.

Training and updating the model without raw data

Two practical strategies:

Federated learning — devices compute gradients locally and send encrypted, aggregated updates to the server; the server aggregates updates and produces a new global model. This pattern reduces raw data exposure.
Server-side aggregated training — store only anonymized, aggregated counters in an OLAP store (ClickHouse or BigQuery). Train models on these aggregates and publish distilled models for the client.

In 2025–2026, federated learning toolchains matured and some libraries provide end-to-end pipelines. For small teams, start with aggregated training and add federated components later.

UX considerations and hybrid features

You can still provide useful features without sacrificing privacy:

Local filters: dietary tags, price caps, and explicit likes are kept on-device and can be toggled instantly.
Contextual signals from server: time-of-day, weather (non-user-specific), and city-level popularity can be provided by the server to help ranking without user PII.
Shared decision mode: for multi-user group apps, use a peer-to-peer handshake (local network/Bluetooth) to share ephemeral preferences for a session—no backend required.

Operational practices

Run the microservice with typical production hardening:

Use TLS everywhere; HSTS for the API domain. Consider multi-cloud strategies to avoid single-vendor outages.
Audit logs but never log PII or location strings. Keep request logs hashed and purged regularly.
Rate limit per API key and issue scoped keys for partners.
Monitor model size and inference latency on common devices; prefer smaller models for older phones. Embed timing analysis into your DevOps pipeline for predictable latency (timing analysis).

Example: end-to-end flow (user story)

Imagine Ana opens your dining micro-app at lunchtime:

The app computes Ana's geohash tile at precision 7 locally from GPS—no network call yet.
The app requests GET /v1/tiles/9q9hv from your microservice. The server responds with POI metadata for that tile.
The app loads the local model weights (already cached) and scores each POI against Ana's on-device profile (her saved dietary tags and cuisine preferences).
The local UI shows the ranked list. If Ana taps “share” to ask friends, the app uses a local peer-to-peer invite over Bluetooth to share the session—no server persists Ana's profile or exact location.

Sample production checklist

Define tile precision and document privacy trade-offs.
Make POI sync jobs resilient and idempotent; schedule periodic updates from maps provider.
Implement caching and ETag/If-Modified-Since to reduce bandwidth and costs.
Publish a privacy schema and show users what stays on device.
Run a small-scale federated pilot if needed; otherwise, rely on aggregated analytics.

Advanced strategies and future directions (2026+)

Look ahead to these advanced techniques if you're building for scale or tighter personalization needs:

On-device personalization ensembles: combine a simple linear scorer with a small neural reranker for candidate refinement (see on-device AI patterns).
WebNN / WASM inference: run compact models in the browser without native dependencies for PWA use-cases (WebNN and WASM runtimes are maturing).
Server-side candidate pruning: use cheap filters (open/closed states, price ranges) server-side to reduce tile payload size; leave personalization to the client.
Privacy-preserving analytics: publish aggregate insights using ClickHouse or similar OLAP backends with differential privacy noise injection for product metrics (instrument and audit carefully).

Pitfalls to avoid

Sending raw device coordinates to third-party SaaS for ranking (you lose control of PII).
Exposing persistent device IDs or app-scoped IDs in telemetry without consent.
Overfitting on population-level popularity signals—local tastes matter and should be captured on-device.

Quick start checklist (build in one afternoon)

Pick a maps data source (OpenStreetMap for an open stack; Google/HERE/Mapbox if you need richer POI metadata).
Implement a POI ingest job to a simple Postgres/PostGIS table keyed by geohash.
Build the Node.js tile microservice (sample code above).
Create a tiny client scoring module (JavaScript) and store user preferences on-device.
Ship a minimal UI that fetches tiles and ranks locally.

Actionable takeaways

Keep the profile local. The simplest way to preserve privacy is to never transmit user preferences or precise coordinates off the device.
Use tile-based queries. Requesting geohash tiles instead of coordinates reduces risk and meets many privacy regulations.
Distribute tiny models. Publish small, curated models or weight files for local scoring and update them with aggregated or federated pipelines (ensure you verify downloads).
Audit and document. Make it explicit to users what data is kept locally and what your server processes.

Resources and libs (2026)

Core ML, TensorFlow Lite, ONNX Runtime Mobile for on-device inference
Geohash and S2 libraries (client and server)
ClickHouse for aggregated analytics and OLAP use-cases
Privacy libraries: OpenDP (differential privacy), TensorFlow Federated for federated experiments (edge AI & federated examples)

Conclusion & call to action

Privacy-preserving recommendations are achievable and practical in 2026. By combining tile-based maps data, a small microservice to serve POI tiles, and on-device scoring, you can build a restaurant recommender that respects user privacy, stays fast, and scales. Start with a simple linear scorer in the client and a tile microservice—iterate to federated or distilled models as you grow.

Ready to build? Clone the starter repo, deploy the tile microservice in a free tier, and run the local scorer on a test phone. Share your results, and we’ll highlight strong privacy-first projects in our next tutorial series.

Unknown

Contributor

Senior editor and content strategist. Writing about technology, design, and the future of digital media. Follow along for deep dives into the industry's moving parts.