Incorrect results when using last_value()

Question

Here is the table: id region variety prices 1 Alexander Valley Cabernet Sauvignon 352 Alexander Valley Cabernet Sauvignon 453 Alexander Valley Merlot 194 California Sauvignon Blanc 85 California Pinot Noir 17 I want to find out the cheapest and most expensive varieties for each region, so the output should Are: Region Expensive Alexander Valley Cabernet Sauvignon Merlot California Pinot Noir Sauvignon Blanc I was able to get the correct results using two first_value() SELECTDISTINCTregion,FIRST_VALUE(variety)OVER(PARTI

P粉253800312 · Answer

The default window for

FIRST_VALUE and LAST_VALUE is ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW. ie. This is the first response. The last value is "so far".

However, you want it to apply to the entire dataset, so you must explicitly describe the window range:

SELECT DISTINCT
  region,
  FIRST_VALUE(variety) OVER 
    (PARTITION BY region ORDER BY price DESC
     ROWS BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING) AS expensive,
  LAST_VALUE(variety) OVER 
     (PARTITION BY region ORDER BY price DESC
      ROWS BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING) AS cheapest
FROM wine_list;

id	area	Variety	price
1	Alexander Valley	Cabernet Sauvignon	35
2	Alexander Valley	Cabernet Sauvignon	45
3	Alexander Valley	Merlot	19
4	California	Sauvignon Blanc	8
5	California	Pinot Noir	17