File types indexable by Google

Google can index the content of most text-based files and certain encoded document formats. The most common file types we index include:

  • Adobe Portable Document Format (.pdf)
  • Adobe PostScript (.ps)
  • Comma-Separated Values (.csv)
  • Electronic Publication (.epub)
  • Google Earth (.kml,.kmz)
  • GPS eXchange Format (.gpx)
  • Hancom Hanword (.hwp)
  • HTML (.htm,.html, other file extensions)
  • Microsoft Excel (.xls,.xlsx)
  • Microsoft PowerPoint (.ppt,.pptx)
  • Microsoft Word (.doc,.docx)
  • OpenOffice presentation (.odp)
  • OpenOffice spreadsheet (.ods)
  • OpenOffice text (.odt)
  • Rich Text Format (.rtf)
  • Scalable Vector Graphics (.svg)
  • TeX/LaTeX (.tex)
  • Text (.txt,.text, other file extensions), including source code in common programming languages, such as:
    • Basic source code (.bas)
    • C/C++ source code (.c,.cc,.cpp,.cxx,.h,.hpp)
    • C# source code (.cs)
    • Java source code (.java)
    • Perl source code (.pl)
    • Python source code (.py)
  • Wireless Markup Language (.wml,.wap)
  • XML (.xml)

Google can also index the following media formats:

  • Image formats: BMP, GIF, JPEG, PNG, WebP, and SVG
  • Video formats: 3GP, 3G2, ASF, AVI, DivX, M2V, M3U, M3U8, M4V, MKV, MOV, MP4, MPEG, OGV, QVT, RAM, RM, VOB, WebM, WMV, and XAP

Search by file type

You can use thefiletype:operator in Google Search to limit results to a specific file type or file extension. For example, filetype:rtf galway will search for RTF files and URLs ending in.rtfwhose content contains the term "galway".