Methodology

How SigFact builds a two-way intelligence bridge across language barriers

Official-source monitoring, structured extraction, and editorial verification — turning fragmented local-language company announcements into decision-ready intelligence that flows both ways across the language barrier.

Why Language Barriers Create Information Gaps

Most competitive intelligence tools monitor English-language sources. But the majority of company announcements in Asia are published first — and sometimes only — in local languages: Japanese press releases, Korean DART filings, Chinese regulatory disclosures, Indonesian exchange announcements.

This creates a structural information gap: English-only monitoring misses signals that are visible to local-market participants but invisible to global teams. By the time these signals appear in English media, the information advantage has already dissipated.

English-Only Monitoring

  • Misses local-language press releases
  • Delayed coverage (days to weeks)
  • Filtered through third-party interpretation
  • No access to regulatory filings in local languages

Cross-Language Monitoring

  • Captures signals at the source
  • Same-day or next-day coverage
  • Direct from official company channels
  • Full access to local regulatory filings

Why Local-Language Sources Matter

SigFact monitors official company sources in 6 languages: Japanese, Korean, Chinese, Hindi, Bahasa Indonesia, and Thai. These are not translations — they are the original announcement channels where companies publish first.

JA
JapaneseTSE filings, company newsrooms, EDINET

Sony IR page publishes earnings in Japanese hours before English translation

KO
KoreanDART filings, KRX announcements, company IR

Samsung DART filings contain details not in English press releases

ZH
ChineseCSRC, SSE/SZSE filings, company newsrooms

BYD CSRC filings and SZSE announcements in Mandarin only

HI
HindiBSE/NSE filings, company newsrooms

Infosys BSE filings with regional market context

ID
Bahasa IndonesiaIDX filings, OJK disclosures

GoTo IDX filings published in Bahasa before English

TH
ThaiSET filings, company pages

SCB SET filings and Thai-language press releases

ZH-TW
Traditional Chinese (Taiwan)TWSE filings, MOPS disclosures, company newsrooms

TSMC TWSE filings and Foxconn announcements published in Traditional Chinese before English

Canonical Sources Only

SigFact classifies sources into four layers. Every published signal must be traceable to a Layer 1-3 canonical source. Discovery inputs are used to identify signals, then verified against canonical sources before publication.

Layer 1 — Direct Company Sourcescanonical

Company newsrooms, official websites, press release pages

Layer 2 — Investor & Regulatorycanonical

IR pages, stock exchange filings (TSE, KRX, TWSE, SSE/SZSE), regulatory disclosures, annual reports

Layer 3 — Canonical Wire Distributioncanonical

PR Newswire, Business Wire, GlobeNewswire

Layer 4 — Discovery with Verificationdiscovery

Official LinkedIn, regional social accounts, industry databases — used to discover signals, then verified against Layer 1-3 sources

Key principle: Discovery inputs are used to identify signals, which are then verified against canonical sources before publication. No signal is published based on discovery inputs alone.

Signal Normalization Across Languages

A Japanese press release and a Korean DART filing may describe similar events using completely different terminology and structure. SigFact normalizes signals from all source languages into a consistent schema:

Normalized Signal Schema
Event TypeStandardized taxonomy (Product Launch, Partnership, M&A, etc.)
CompanyMapped to unified company entity with cross-language aliases
ChannelClassified into 11 channels across 8 industry and 3 regional themes
RegionGeographic relevance tagging (12-region taxonomy)
Source ClassCanonical (Layer 1-3) or Discovery
Source TrustSource trust label: Official / Verified / Wire / Discovery / Reviewed
SummaryStructured multilingual summary (EN + ZH + JA + KO + ZH-TW)
Key Takeaways3-5 actionable takeaways per signal

This normalization means you can compare a Samsung DART filing with a TSMC TWSE announcement or a Sony TSE disclosure in the same structured format — regardless of the original language.

From Source to Signal to Brief

Every signal goes through a structured pipeline combining AI extraction with human verification:

01

Source Monitoring

Automated monitoring of official company sources in 6 languages. New announcements are detected and queued for extraction.

02

Structured Extraction

AI-powered extraction identifies event type, key entities, financial data, and geographic relevance from the original-language source.

03

Source Trust & Classification

Each signal is assigned a Source Trust label (Official, Verified, Wire, Discovery, Reviewed) based on source tier. Signals are classified by channel and event taxonomy.

04

Editorial Verification

Human editors verify extraction accuracy against canonical sources. Key claims are cross-checked before publication.

05

Normalization & Delivery

Verified signals are normalized into the standard schema, multilingual summaries are generated in 5 languages, and signals are delivered via briefs, digests, and the signal database.

AI-Augmented, Human-Verified

AI handles extraction, scoring, and draft generation. Every published signal is verified against canonical sources by our editorial process. This is not automated news aggregation — it is structured intelligence with a verification layer.

What We Track

Included

  • Funding and investment announcements
  • Strategic partnerships and alliances
  • Product launches and platform updates
  • Market expansion and geographic growth
  • Executive appointments and leadership changes
  • Earnings and financial milestones
  • R&D breakthroughs and patent filings
  • Regulatory actions and compliance updates

Excluded

  • Blog posts and opinion pieces
  • Generic marketing content
  • Event promotions and sponsorships
  • Unrelated media coverage
  • Commentary not tied to official announcements
  • Unofficial rumors or social media speculation

Summary & Delivery Principles

All summaries are generated with a focus on accuracy and utility. They are neutral, concise, and structured — based directly on the original source material.

SigFact delivers intelligence in 5 languages — English, Chinese, Japanese, Korean, and Traditional Chinese. This multilingual delivery is a strategic design decision for cross-market professionals, ensuring teams across the region access the same structured intelligence.

Users should always refer to the original source for full wording, context, and official details. SigFact provides structured summaries for professional convenience, but the original company announcement remains the authoritative source.

See the Methodology in Action

Browse our signal database to see how cross-language source monitoring produces structured, decision-ready intelligence.