menu
Data

Datasets

Datasets represent a collection of timeseries which share a common schema, owner, and are generally subject to consistent entitlements. The easiest way to learn how to interact with datasets is to use the WEATHER example, which contains weather information for several major US cities from the National Weather Service. This dataset is available to all applications and does not need to be requested explicitly.

You can view and request available datasets from the Marquee Data Catalog.

Dataset Objects

All our datasets are either time-based (intraday) or date-based (daily). Datasets return fields that are categorized as either measures or dimensions:

  • Measures are facts that are usually quantities and that can be aggregated, such as tickers and exchanges
  • Dimensions describe or provide context to measures, like closing prices and volumes

If the data is a curve or surface, then the axes will usually be dimensions.

Date-based Datasets

Date-based datasets have date listed as a field in their data description. The following parameters are valid for queries to date-based datasets:

  • startDate (optional) - defaults to the end date minus a dataset-specific interval, which is currently 30 days
  • endDate (optional) - defaults to the current date
  • dates (optional) - used to return data from a specific set of dates

Time-based Datasets

Time-based datasets have time and updateTime listed as a field in their data description page. The following parameters are valid for queries to time-based datasets:

  • startTime (optional) - defaults to the end time minus a dataset-specific
  • interval - currently 24 hours for all real-time datasets
  • endTime (optional) - defaults to the current datetime
  • times (optional) - used to return data from a specific set of date times

Using Intervals

Time-based datasets can have very large number of observations over a given window (thousands per second). In order to interact with this data, our APIs provide the ability to down-sample data on our servers over given intervals. The intervals parameter allow you to get data which is evenly distributed in the specified number of intervals between the time or date range specified. Example:

from gs_quant.data import Dataset
from datetime import date

weather_ds = Dataset('WEATHER')
data_frame = weather_ds.get_data(date(2016, 1, 1),  date(2016, 1, 31), city=["Boston"], intervals=3)

print(data_frame)

Output:

     city        date  dewPoint  ...  snowfall                updateTime  windSpeed
0  Boston  2016-01-11      12.0  ...       0.0  2017-03-06T16:49:36.472Z       19.0
1  Boston  2016-01-21       7.0  ...       0.0  2017-03-06T16:49:36.473Z       13.6
2  Boston  2016-01-31      29.0  ...       0.0  2017-03-06T16:49:36.476Z       11.6

[3 rows x 10 columns]
info

Note

API responses are limited to approximately 100MB. If you receive a 400 Bad Request exception with the message "Number of rows returned... are more than maximum allowed", batch your query down into multiple, smaller queries.

Consider using smaller date / time ranges (adjust startTime and endTime or startDate and endDate as needed) or querying for fewer entities (e.g. asset ids, reports) each time.

Field Selection

If you want to ensure the response only contains the fields that you are interested in, you can use the fields parameters. In the weather dataset, say that you are only interested in maxTemperature and minTemperature. To only return these two fields, pass in the desired fields as arguments. Example:

from gs_quant.data import Dataset
from datetime import date

weather_ds = Dataset('WEATHER')
data_frame = weather_ds.get_data(date(2016, 1, 1), date(2016, 1, 2), city=["Boston"], fields=['maxTemperature', 'minTemperature'])

print(data_frame)

Output:

     city        date  maxTemperature  minTemperature
0  Boston  2016-01-01            41.0            33.0
1  Boston  2016-01-02            40.0            31.0

Related Content


This site is for informational purposes only and does not constitute an offer to sell, or the solicitation of an offer to buy, any security. The Goldman Sachs Marquee® platform is for institutional and professional clients only. Some of the services and products described on this site may not be available in certain jurisdictions or to certain types of client. Please contact your Goldman Sachs sales representative with any questions. Nothing on this site constitutes an offer, or an invitation to make an offer from Goldman Sachs to purchase or sell a product. This site is given for purely indicative purposes and does not create any contractual relationship between you and Goldman Sachs. Any market information contained on the site (including but not limited to pricing levels) is based on data available to Goldman Sachs at a given moment and may change from time to time. There is no representation that any transaction can or could have been effected on such terms or at such prices. Please see https://www.goldmansachs.com/disclaimer/sec-div-disclaimers-for-electronic-comms.html for additional information. © 2023 Goldman Sachs. All rights reserved.
Transaction Banking services are offered by Goldman Sachs Bank USA (“GS Bank”). GS Bank is a New York State chartered bank, a member of the Federal Reserve System and a Member FDIC. © 2023 Goldman Sachs. All rights reserved.
Not all products and functionality mentioned on this website are currently available through our API platform.
All loans and deposit products are provided by Goldman Sachs Bank USA, Salt Lake City Branch. Member FDIC.
Brokerage and investment advisory services offered by our investment products are provided by Goldman Sachs & Co. LLC (`‘GS&CO.`’), which is an SEC registered broker-dealer and investment adviser, and member FINRA/SIPC. Research our firm at FINRA's BrokerCheck. Custody and clearing services are provided by Apex Clearing Corporation, a registered broker-dealer and member FINRA/SIPC. Please consider your objectives before investing. A diversified portfolio does not ensure a profit or protect against a loss. Past performance does not guarantee future results. Investment outcomes and projections are forward-looking statements and hypothetical in nature. Neither this website nor any of its contents shall constitute an offer, solicitation, or advice to buy or sell securities in any jurisdictions where GS&Co. is not registered. Any information provided prior to opening an investment account is on the basis that it will not constitute investment advice and that GS&Co. is not a fiduciary to any person by reason of providing such information. For more information about our investment offerings, visit our Full Disclosures.
Investment products are: NOT FDIC INSURED ∙ NOT A DEPOSIT OR OTHER OBLIGATION OF, OR GUARANTEED BY, GOLDMAN SACHS BANK USA ∙ SUBJECT TO INVESTMENT RISKS, INCLUDING POSSIBLE LOSS OF THE PRINCIPAL AMOUNT INVESTED